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PREFACE 

Thirty years ago Professor Charles E. Spearman introduced the factor 
problem in psychology when he observed that the intercorrelations of a set 
of tests revealed an underlying order. He interpreted this order as the effect 
of a conspicuous factor that was common to all of the tests. There has been 
much controversy about different aspects of Spearman's single-factor hy- 
pothesis and about his single-common-factor methods of analyzing inter- 
correlations. His single-common-factor hypothesis is that the intercorrela- 
tions of a group of tests can be explained in terms of a single central intellec- 
tive factor which has been denoted "g," and that the variance of each test 
can be explained by the "g" factor and a factor that is specific and unique 
for each test. Since his hypothesis involves a factor which all of the tests 
have in common and a factor which is unique for each test, it is frequently 
called a "two-factor hypothesis." Spearman's single-factor methods are con- 
cerned with the isolation of a single common factor in each test battery 
which can be so analyzed. 

Another interpretation of the mental abilities is that of Professor E. L. 
Thorndike, who has been a leader in this type of psychological research. It 
has been his judgment that the socially significant mental abilities are nu- 
merous and discrete. 

The factor methods described in this volume are based on the assumption 
that a test score can be expressed, in first approximation, as a linear func- 
tion of a number of factors. My previous papers on the multiple-factor 
problem are as follows: 

"Multiple Factor Analysis/' Psychological Review, XXXVIII, No, 5 (September, 
1931), 406-27. 

"A Multiple Factor Study of Vocational Interests," Personnel Journal, X, No. 3 
(October, 1931), 198-205. 

"Isolation of Blocs in a Legislative Body by the Voting Records of Its Members," 
Journal of Social Psychology, III, No. 4 (November, 1932), 425-33. 

Theory of Multiple Factors (January, 1933). Pp. 65. University of Chicago Book- 
store. 

Computing Diagrams for Tetrachoric Correlation Coefficients (April, 1933). Pp. 57. 
University of Chicago Bookstore. 

A Simplified Multiple Factor Method (May, 1933). Pp. 25. University of Chicago 
Bookstore. 

"The Vectors of Mind," Psychological Review, XLI, No. 1 (January, 1934), 1-32. 

"Unitary Abilities," Journal of General Psychology, XI, No. 1 (July, 1934), 126-32. 

vii 
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The fundamental equation in my first paper on factor theory is the same 
as the first equation in the present volume, but the development here pre- 
sented is more formal and considerably extended. The fundamental as- 
sumptions and the corresponding theorems are given in chapters i and ii. 
The centroid method which was described in my first paper has been im- 
proved several times, and it is presented in chapter Hi. The notation has 
been made more explicit and unambiguous. The fundamental factor theo- 
rem was first stated in Theory of Multiple Factors. In the present volume 
this theorem is the subject of chapter ii. The theorem states that the num- 
ber of linearly independent common factors in a battery of tests is the rank 
of their reduced correlational matrix. 

In Spearman's special case, where only one common factor is involved, 
the rank of the correlational matrix must therefore be one. Hence all sec- 
ond-order minors must vanish. The expansions of the second-order minors 
are, in fact, Spearman's tetrad differences. This case is discussed in chap- 
ter v. We are concerned here with the generalization of the factor problem 
to n dimensions. 

The geometrical formulation of the factor problem which was described 
in my earlier papers has been reproduced in this volume. Each test may be 
regarded as a radial vector in a common-factor space of as many dimensions 
as there are common factors in a test battery. The correlation between any 
pair of tests is the scalar product of the test vectors. Since the scalar prod- 
uct of a pair of vectors is independent of the co-ordinate system, it follows 
that the intertest correlations define the test configuration in a common- 
factor space but that they do not define the co-ordinate system. But the 
co-ordinate axes are the scientific categories in terms of which the tests are 
to be comprehended. This is an interesting indeterminacy. One of the prin- 
cipal problems of factor analysis is to find a unique set of co-ordinate axes, 
either orthogonal or oblique, which shall represent scientifically meaningful 
categories in terms of which the tests may be comprehended. This problem 
has been solved in terms of what I have called "simple structure" of a trait 
configuration. This concept is developed in chapters vi and vii. 

One of the important restrictions that must be satisfied by any acceptable 
solution to the factor problem is that the factorial description of a trait or 
test must be invariant when it is moved from one battery to another. No 
form of uniqueness can be scientifically meaningful which violates this prin- 
ciple. This is the reason why I have discarded one of my earlier solutions, 
namely, the principal axes of the configuration; and it is also the reason why 
Professor Harold Hotelling's special case of the principal axes solution must 
be discarded. His special case has been called a method of principal com- 
ponents. The principal axes are discussed in chapter iv. 

In some applications of factor theory it seems appropriate to impose the 
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restriction that the direction cosines of each trait vector shall be positive or 
zero. This special case of the factor problem is developed in chapter viii on 
"The Positive Manifold." In some applications it may be appropriate to 
impose the restriction that the fundamental categories or reference traits 
shall be uncorrelated in the experimental population. The reference vectors 
are then orthogonal, and the determination of an orthogonal simple struc- 
ture is then demanded. Orthogonal transformations of possible use for this 
problem are described in chapter ix. 

Finally, when the common factors are known, it is of interest to appraise 
each individual member of the statistical population as regards each of the 
primary factors. The solution of this regression problem is given in chap- 
ter x. The other regression involves the prediction of a test performance in 
terms of the test coefficients and primary factors. The solution to this re- 
gression problem is also given in chapter x. 

Since I was myself unfamiliar with matrix theory until very recently, I 
could hardly take this subject for granted in writing for other psychologists 
with limitations of training that are similar to mine. It was therefore im- 
perative to supply students of factor analysis with a mathematical intro- 
duction to matrix theory and related topics. This seemed all the more nec- 
essary in view of the fact that the available textbooks on this subject are 
unsatisfactory. In the "Mathematical Introduction' 7 I have attempted to 
present the essential mathematical ideas as clearly as may be possible in the 
scope of a single chapter. The introduction is written for students who have 
had the conventional undergraduate instruction in analytic geometry and 
in the calculus. It is explicitly limited to the real case, since complex num- 
bers and imaginaries have not yet been introduced in factor analysis. 

One of the turning-points in the development of multiple-factor analysis 
was the discovery in 1931 that the mathematics most adaptable to this prob- 
lem was matrix theory. I once asked Professor Gilbert A. Bliss how to fac- 
tor a correlation table, but I did not call it a "matrix/' He suggested that 
matrix theory might be applicable to my problem, but I was entirely un- 
familiar with this branch of mathematics. Since that time I have profited 
on numerous occasions by the generosity of the members of the Department 
of Mathematics at the University of Chicago. I appreciate especially the 
interest of Professor R. W* Barnard. He suggested the equation by which a 
simple structure can be represented. A "simple structure 7 ' may be regarded 
either as a combined configuration of test vectors and reference vectors or 
as the aggregate of co-ordinate hyperplanes. Professor Barnard has also 
made valuable suggestions in connection with the problem of determining 
the co-ordinate hyperplanes of a simple structure by successive approxima- 
tion in the analytical method. 

I owe a special acknowledgment to my tutor in mathematics, Mr. Patrick 
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Youtz, who assisted me in becoming; familiar with the elements of matrix 
theory. Later I was fortunate when Mr. Youtz accepted a full-time assign- 
ment on the factor projects. He has examined, mathematically, each of the 
factor methods, and he has read and criticized in detail the manuscript for 
this volume. It is through him that I have become acquainted with some 
of the conventions of mathematical writing. 

Special acknowledgment is due Miss Leone Chesire, who has been re- 
sponsible for most of the computing on my factor studies during several 
years. I have relied constantly on Miss Chesire's competent work in testing 
the many leads that we have investigated in factor theory. Her careful 
criticism of the entire manuscript has been of great value, and she has pre- 
pared the appendix on the centroid method. 

I am indebted to my colleagues, Professor Mortimer J. Adler, Professor 
A. C. Benjamin, and Professor C. W. Morris, for reading and criticizing the 
general sections of the first chapter. 

The entire manuscript for this volume has been read and criticized in. 
detail by four readers. Mr. Patrick Youtz and Miss Leone Chesire have 
read the manuscript and shared in the supervision of the computing. My 
wife, Thelma Gwinn Thurstone, has read and criticized the manuscript both 
for the mathematical and the psychological content. Mr. Joseph Novak, as 
a mathematician, has read and criticized the manuscript without previous 
familiarity with the factor problem. All of these readers have suggested 
many revisions that were intended to clarify the exposition, but I assume 
responsibility for all of the solutions, as well as for any errors that may be 
found. It cannot be hoped that this volume will be free from errors, since 
all of the chapters, except chapter v, cover new ground. I am indebted also 
to Mrs. Cypra Feinsot, who has supervised the work of preparing the manu- 
script for the publishers. 

Three studies are now in progress which involve applications of the fac- 
torial methods. These 'will appear eventually in monograph form. They 
are (1) a factor analysis of sixty psychological tests that were taken by two 
hundred and forty college students who volunteered fifteen hours of testing, 
(2) a factor study of several hundred personality traits on which thirteen 
hundred adults were rated, and (3) a factor study of vocational interests of 
three thousand college students with respect to eighty professions. 

In carrying out these theoretical investigations, as well as the practical 
applications, the Social Science Research Committee at the IMversity of 
Chicago has been most generous. I wish to acknowledge especially the in- 
terest of Professor Donald Slesinger, chairman of the Committee, I am 
grateful for the financial assistance and for the physical facilities that tMs 
Committee has placed at my disposal during the past four years. I am grate-* 
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f ul for a grant by the Carnegie Corporation of New York by which it became 
possible to add several research assistants during the past year. This grant 
has considerably aided in the development of factor theory. I am also 
grateful to the Illinois Emergency Relief Commission for assigning relief 
funds to these studies. Twelve computers have been at work on these fac- 
tor projects. 

The future development of factor theory will probably reduce factor 
analysis to simpler computing methods. The linear approximations that are 
here used may eventually prove to be inadequate, but it is likely that much 
can be accomplished by these approximations in psychology and in other 
social sciences. The factor methods may be regarded as an intermediate 
stage in the development of science. No one would think of investigating 
the fundamental laws of classical mechanics by correlational methods or by 
factor methods, because the laws of classical mechanics are already well 
known. If nothing were known about the law of falling bodies, it would be 
sensible to analyze, factorially, a great many attributes of objects that are 
dropped or thrown from an elevated point. It would then be discovered 
that one factor is heavily loaded with the time of fall and with the distance 
fallen but that this factor has a zero loading in the weight of the object. 
The usefulness of the factor methods will be at the borderline of science. 

No attempt has been made in this volume to integrate the present multi- 
ple-factor analysis with the previous work of Professor Sewell Wright on 
path coefficients and with the work of Professor Truman L. Kelley on mul- 
tiple factors. While these several approaches to the problem seem to be 
quite different, it should be possible to unify them. As far as I am aware, 
my own work is not in conflict with the work of others on the multiple-fac- 
tor problem or with that of Professor Spearman on the single-factor meth- 
ods. The development of factor theory, as weU as its applications in science, 
will be accelerated by the assistance of mathematicians; and it is gratifying 
that Professor E. B. Wilson has turned his attention to these problems in 
several papers. The future development of factor analysis in psychology 
will probably require more mathematical competence than we can supply 

in our own ranks. 

L. L. THUBSTONE 

CHICAGO, ILLINOIS 
March, 1935 
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MATHEMATICAL INTRODUCTION 

The matrix theory which is used in the development of factor analysis is 
not generally available to students whose training in mathematics is limited 
to undergraduate courses in analytical geometry and in the calculus. This 
mathematical introduction reviews the elementary theory of matrices as 
well as the closely related theory of determinants. Summation that involves 
double subscript notation is included in this section, since it is used in factor 
theory and since it is unfamiliar to most students of statistics. In the geo- 
metrical interpretation of the factorial matrix, only non-homogeneous co- 
ordinates are used. For this reason, the introduction includes non-homo- 
geneous co-ordinates and omits homogeneous co-ordinates which are con- 
ventional. Orthogonal and oblique transformations have been illustrated 
geometrically. No provocation has been found so far in factor theory to in- 
troduce imaginaries and complex numbers, but the future development of 
factor analysis may call for them. This mathematical introduction is limited 
to the real case, and all theorems have been written with this restriction in 
mind. 

If this introduction is not self-sufficient, perhaps it may serve as a useful 
guide to the student of factor theory who seeks mathematical assistance on 
specified topics. If a student has the intention of attaining some competence 
in factor theory and in related statistical work, there is no short cut for 
formal courses in the mathematics that is involved.* 

Matrices 

Matrices and determinants involve rectangular arrangements of num- 
bers. Any rectangular arrangement of numbers is called a matrix, irrespec- 
tive of what the numbers mean. If the matrix has m rows and n columns, 
the matrix is said to be of order mXn. In designating the order of a matrix, 
it is customary to refer to rows first and columns second. Thus a matrix of 

* The following references wiE be found useful: 

W. F. Osgood and W. C. Graustein, Plane and Solid Analytic Geometry (New York: 
Macmillan Co., 1929). 

L, E. Dickson, Modern Algebraic Theories (New York: B. H. Sanborn, 1926), chap. iii. 
/XMaxime B6cher, Introduction to Higher Algebra (New York: Macmillan Co., 1931), 
chaps. i~vi, inc. 

H. W, Turnbull and A. C. Aitken, Canonical Matrices (London: Blackie & Sons, 
1932), chap. i. 

V. Snyder andC. H. Sisam, Analytical Geometry of Space (New York: Henry Holt & 
Co,, 1914), 

1 
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order pXq has p rows and q columns. Tables la and Ib show a matrix of 
order 3X4 and a matrix of order 3X3. A row is horizontal. A column is 
vertical. The general name for either a row or a column is an array. Each 
of the small squares into which a matrix is divided is called a cell, and the 
number in each cell is called a cell entry or element, 

Table la Table Ib 

2315 215 

1609 483 

0267 207 

In order to designate a particular element, it is customary to use a double 
subscript, the first one for the row and the second one for the column. If a 
matrix is denoted A, then its elements may be denoted an, where i shows 
the row and j shows the column at the intersection of which the element a# 
is found. Thus, in Table la the element ais = 3 and as4=9. 

In developing the theory of matrices it is desirable to exhibit the ele- 
ments as shown in Table 2. The elements in the first row are an, ai 2 , ais, 
. . . , a iw , showing that the table represents a matrix of n columns. The ele- 
ments of the first column are an, a^i, a 31 , . . . , a m i, showing that the table 
represents m rows. The general element in this matrix A is a# 9 where i 
takes the successive values 1, 2, 3, . . . , m, while j takes the successive 
values 1, 2, 3, . . . , n. The first subscript refers to the row; the second sub- 
script refers to the column. 

Table 2 
#u #12 #13 ... am 

#21 #22 #23 ... #2n 
#31 #32 #33 ... #3n 

#17 ... 

#ml #m2 #wt3 . . . #mn, 

The conventional representation of a matrix is shown in Table 3, where 
the rectangular arrangement of numbers is inclosed by double vertical lines 
on the left and on the right sides of the rectangle. It is also customary to 
denote specified matrices with letters. Thus, the matrix of Table 3 might be 
conveniently designated A or any other letter. A matrix A might also be 
designated by its general element a#. 

Table 3 
215 

483 
207 
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If the successive rows of matrix A are written as successive columns of a 
new matrix, the new matrix is called the transpose of A. It is denoted A'. 
Table 4 shows a matrix A and its transpose A'. 



Table 4 



2315 
1609 
0267 



210 
362 
106 



..597 
A A' 

Determinants 

One particular interpretation of a square matrix is called a determinant. 
This interpretation of a square matrix probably had its origin in the practi- 
cal work of solving simultaneous equations, and it is indicated by single 
vertical lines on the left and on the right sides of a square table. It is illus- 
trated in Table 5. Table 3 is called a matrix; while Table 5, which implies a 
particular interpretation, is called a determinant. A determinant is always 
square. Hence its order is n, in which n is the number of rows or the number 
of columns. 

Table 5 

215 

483 

207 

The diagonal from the upper left corner to the lower right corner of a 
determinant is called the principal diagonal. In Table 5 the principal diag- 
onal contains the elements 2, 8, 7. The other diagonal from the lower left 
corner to the upper right corner is called the secondary diagonal, 

In many problems it is convenient to assign a plus sign and a minus sign 
to alternate cells in a determinant. A convenient rule is to designate the 
upper left cell as positive and all other cells as alternately negative and 
positive, as the cells can be moved over by the castle in a chess game. This 
sign arrangement is illustrated in Table 6 for a determinant of order 5. 

Table 6 

4- - 4- - 4- 

- 4- - 4- - 
4- - 4- - 4- 

- 4- - 4- - 
4- - 4- - + 
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Notice that the cells in the principal diagonal are all positive and that lines 
parallel to this diagonal are alternately negative and positive. The sign is 
positive for an even number of steps from the upper left cell, and it is nega- 
tive when the number of steps is odd. The sign of a cell determined in this 
manner may be called the position sign of the cell. If the general element of 
the determinant is denoted a 3 -, then the element with its position sign may 
be conveniently denoted ( I) i+3 'a i3 : When the exponent (i+j) is odd, the 
sign of the cell is negative; and when (i+j) is even, the sign of the cell is 
positive. 

The product of any n elements of a square matrix, selected with only one 
element from each of the n rows and only one element from each of the n 
columns is called a term of the determinant of the matrix. Table 7* is a deter- 

Table fl > 

GU, #12 #13 

#21 #22 #* 
#31 # #*? 

minant of order 3. From this determinant six terms may be written. These 
are shown in Table ^jin which the elements of each term are arranged in 
the order of their columns. 

Tablet 7 

1) ttn Ozz #83 

2) flu #82 #23 

3) #21 aw #83 

4) #21 #82 #13 

5) #31 #12 #23 

6) (Lsi #22 #13 

Each of these six terms is the product of three elements so selected that 
each term contains only one element from each row and only one element 
from each column. If a square matrix is of order n, the total number of 
terms in its determinant is \n. The term that contains all the elements of 
the principal diagonal is called the leading term of the determinant. 

The sign of each of the \n terms of a determinant can be ascertained in 
the following manner. Let the n elements of each term be arranged in as- 
cending order according to columns, as shown in Table This can evident- 
ly be done without affecting the numerical value of the terms. Consider 
the fourth term as an example, and list the rows as follows: 

231. 
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Any interchange of two adjacent elements constitutes an inversion. It may 
be illustrated by interchanging 1 and 3. The resulting arrangement is 

2 1 3. 

If, now, the adjacent elements 1 and 2 are interchanged, the arrangement 
becomes 

1 2 3, 

in which the rows are in consecutive order. 

The sign of a term of a determinant is positive if it represents an even 
number of inversions from the consecutive order of rows and columns. The 
sign of the term is negative if the number of inversions in the term is odd. 

Applying this rule to the six terms of Table if, we have the same terms 
with proper signs as shown in Table $: ', 

Tdbkfl 

1) +#11 #22 #33 

2) an #32 #as 

3) #21 #12 #33 

4) 4-021 #32 #13 

5) +#31 #12 #23 

6) #31 #22 #13 

A complete definition of a determinant can now be given. 

Definition: // a, square table is used as a symbol of the sum of [n terms, 

each term being the product of n elements with only one element from 

each row and only one element from each column, the sign of each term 

taken 'positive or negative according as the term contains an even or an 

odd number of inversions, then the square table is called a determinant. 

Hence the sum of the six terms of Table j$is implied by the determinant of 

Table^The determinantal interpretation of a square matrix is denoted by 

single vertical lines on the left and on the right sides of the square table, as 

shown in Table jf^ 

If a, square matrix is denoted by a letter such as A, then the determinant 
of the matrix is denoted \A'\. If A represents a number, then \A\ means 
the absolute value, ignoring the sign of the number A. It should be noted 
that a matrix is merely a rectangular table of numbers, and hence a .matrix 
has no numerical value. But a determinant is, by definition, a sum of terms, 
and hence it has a numerical value. If a matrix is denoted a*,*, then its de- 
terminant is denoted 
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Consider the second-order determinant 

1 5 
8 3 



and the |2 = 2 terms that it implies. These are 1 X3 and 8X5, in which the 
factors of each term are arranged in consecutive order by columns. The rows 
of the term 1X3 are 1 and 2. Since these are in consecutive order, the sign 
of this term is positive. Its value is therefore +(l)(3) = +3. The rows of 
the term 8X5 are 2 and 1. One inversion changes the order 2 and 1 into the 
consecutive order 1 and 2. Hence the sign of this term is negative. The de- 
terminant therefore has the numerical value +3 40= 37. Any second- 
order determinant can be evaluated as follows: 



a d 
c b 



= ab cd , 



An z-rowed minor of the matrix A is a determinant of order x which is 
formed by the intersections of any x rows and any x columns of the ma- 
trix A. If one or more columns of a determinant are eliminated and if the 
same number of rows are eliminated, the remaining cells constitute a minor, 
From the determinant of Table 7 nine second-order minors may be drawn. 
A few of them are illustrated here: 



If any two columns and any two rows are eliminated from the determinant 
of Table 7, there remains a 1-rowed minor which is a single element. In this 
sense each element can be regarded as a minor of the determinant, 

If corresponding rows and columns are eliminated, the remaining minor 
is symmetrically placed with regard to the principal diagonal, and it is 
called a principal minor. In the determinant of Table 7, three second-order 
principal minors may be drawn. These are 



There are three 1-rowed principal minors in this determinant, namely, the 
three elements in the principal diagonal. 
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If the row i and the column j which intersect in an element a^ are elimi- 
nated from a determinant, the remaining (n l)-rowed determinant is 
called the first minor of a*/. This definition is illustrated with the determi- 
nant of Table 5. The second row and the first column intersect in the ele- 
ment 4. If these two arrays are eliminated from the determinant, the re- 
maining 2-rowed determinant is 



1 5 

7 



= 7 - 



This determinant, whose numerical value is +7, is the first minor of the 
element a 2i = 4 in the determinant of Table 5. Let the first minor of the ele- 
ment ay be denoted m^. 

In some problems it is convenient to refer to the minor m^ with the posi- 
tion sign of the element #;,. This quantity is called the cof actor of a^. It is 
defined by the relation 

(cofactor of a t -,-) = e,- = ( 
In Table 5 the cofactor of the element 4 is 

1 5 
7 



(-D 



,2+1 



= - [7-0] = - 7 . 



In the same table the cofactor of the element 3 is 

2 1 



f 1^2+3 



2 



- [0-2] = + 2 . 



Hence the absolute values of the first minor of a^ and of its cofactor are 
identical. They differ only in the manner of determining the sign. If the 
position sign of the element a^ is positive, the first minor and the cofactor 
have the same sign. If the position sign of a t -/ is negative, they have op- 
posite signs. 

The numerical value of a determinant can be expressed conveniently for 
some problems in terms of the cofactors. For example, 



au #12 #13 

#21 #22 #23 
#31 #32 #33 



== #11611 + #21^21 + #31^31 



8 
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The numerical value of a determinant is the weighted sum of the elements in 
any array, each element being weighted by its cofactor. In the example, the 
determinant is expressed in terms of the elements of the first column. 

As a numerical example, the value of the determinant of Table 5 can be 
expressed as follows: 



215 
483 
207 


= +2 


8 3 

7 


-4 


1 5 

7 


+2 


1 5 
8 3 



2(56-0) - 4(7-0) + 2(3-40) 
= 112 - 28 - 74 = + 10 . 

The numerical value of a determinant can be expressed as the summa- 
tion: 



= 



where the weighted sum may be taken over any column or any row. The 
following is an example of a fourth-order determinant, evaluated by the 
method of (1). 



2410 
3242 

1614 


= +2 


242 
614 


-3 


410 
614 


+ 1 


410 

242 


-1 


410 
242 


1023 




023 






023 




023 




614 


242 










6 1 4 
023 


+2 


14 
23 


-6 


4 2 
23 


+0 


4 2 
1 4 





2(3-8) - 6(12-4) + 0(16-2) 
-10-48 + 0=- 58,. 



4 1 
6 1 4 
023 


= +4 


1 4 
23 


-6 


1 
23 


+0 


1 
1 4 



4(3-8) - 6(3-0) + 0(4-0) 
- 20 - 18 + = - 38,, 
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410 








4 2 




1 




1 


242 


= +4 




-2 




+0 








23 




23 




42 


023 





4(12-4) - 2(3-0) + 0(2-0) 
32 - 6 + = + 26 . 



410 








4 2 




1 




1 


242 


-+4 




-2 




+6 








1 4 




1 4 




42 


614 





Hence 

\A\ = 



4(16-2) - 2(4-0) + 6(2-0) 
+ 56 - 8+12 = + 60 . 



-58) - (3)(-38) 



36 



For every element a^ in the square matrix A tjiere is a corresponding 
minor m# and a corresponding cofactor *,. Let M be the square matrix 
with elements m^; let E be the square matrix with elements e^; and let F be 
the transpose of E. Then the square matrix F is called the adjoint of A. 
Its elements may be denoted /# = e#. 

These definitions are illustrated in the following numerical example: 



-li-5 
-48 *3 
42 "0 +7 

56 22 -16 

7 4-2 

-37 -14 12 



=A 



--M 



56 -22 -16 

-7 4 2 

-37 14 12 

56 -7 -37 

-22 4 14 

-16 2 13 



:F=adjoint of A 
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A square matrix is said to be symmetric when an = a^ It is symmetric 
about the principal diagonal. 



145 
428 
583 



=a symmetric matrix . 



If the matrix is symmetric except that the signs above the principal diag- 
onal are opposite to the signs below the diagonal, then the matrix is said to 
be skew symmetric. 

+2 -3 +4 



+3 -5 -5 

-4 +5 +6 



= a skew symmetric matrix . 



If all the principal minors of a matrix are greater than or equal to zero, 
then the matrix is said to be positive-definite. If, in addition, it is symmetric, 
it is a Gramian matrix. 



= a positive definite matrix. 



= a Gramian matrix. 



In some problems it is important to know the highest order of the non- 
vanishing minors. The highest order of the non-vanishing minors is called 
the rank of a matrix. The rank of Table 6 is equal to its order, namely, 3, 
because the determinant itself does not vanish. The determinant of Tabk 10 
does vanish, so that its rank must be less than 3. It contains second-order 
minors that do not vanish, and the rank of the determinant is therefore 2. 

Table 10 
10 8 1 

882 
1 2 1 



2 


3 


-3 


2 


4 


2 


3 


5 


6 


5 


10 


13 


10 


20 


26 


13 


26 


36 
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If the determinant of a matrix is zero, the matrix is said to be singular. 
If the determinant does not vanish, the matrix is said to be non-singular. 

If |a t -y =0, then a^ is. singular. 
If |av 7^0, then a r y is non-singular. 

Most of the theorems in the elementary theory of real determinants are 
concerned with the methods of ascertaining the numerical value of a de- 
terminant and with the operations that do, or do not, affect its numerical 
value. The following theorems are useful in dealing with determinants: 

1) The value of a determinant is equal to that of its transpose. 



a 2 



a 3 



61 



2) If any pair of parallel arrays of a determinant are interchanged, the 
absolute value of the determinant remains unaltered but the sign is re- 
versed. 

ii bi c 



3) If two parallel arrays of a determinant are proportional, the determi- 
nant vanishes. 



02 



= 0; 



ai 02 ct3 
61 62 63 
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If fc = 1, then two arrays are identical and the determinant vanishes. 
4) If a determinant has an array of ciphers, it is equal to zero. 



a b c 
000 

d e f 



= 0. 



5) If each element of an array is multiplied by any factor, the value of 
the determinant is multiplied by that factor. 



kc% 



= k 



a2 as 
62 63 
c 2 c 3 



This theorem is sometimes useful in reducing a determinant to simpler . 
forms. 



698 




1 1 4 


12 18 4 


= 6X9X2 


222 


24 27 2 




431 



6X9X2X2 



114 
1 1 1 
4 3 1 



-648. 



6) If each element in an array is reversed in sign, the value of the de- 
terminant is reversed in sign. 



0*1 bi GI 
a 62 02 
as 63 ca 



ai &i 



as 63 





ai &i Ci 


= + 


+02 &2 Cs 




as 63 Cs 



7) If an array contains no zero elements, all the elements of the array 
may be made unity by means of multiplying factors. 



346 




111 




111 


288 


= 3X4X6 


i- 2 | 


= 3X4 


4 12 8 


679 




2 i | 




2 I I 




1 1 1 




111 


= 3X4X4 


132 


= 3X4 


132 




2ft 




8 76 



-36. 
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8) Every determinant can be expressed as a sum of two determinants. 

(&i+;p) bi GI ai bi GI p bi c 

(&2+(?) bz GZ = dz bz GZ + q bz c 
(a 3 +r) 6 3 c 3 a 3 63 03 r bs G 

9) The value of a determinant remains unaltered if each element in any 
array is augmented by a multiple of the corresponding element in a parallel 
array. 



61 
62 



c 3 



62 



a 3 



The, second determinant in the right member vanishes because two columns 
are identical, 

10) If all the elements of a determinant on one side of the principal diag- 
onal are ciphers, the determinant reduces to the leading term. 



2X3X5=+30 , 



512 




5 1 2 




312 




212 


633 


= 


633 


= 


333 


= 


033 


11 3 8 




505 




005 




005 



Matrix multiplication 

Consider the three simultaneous equations, 



(2) 



2/3 = 



j 

The equations (2) are written in the expanded notation. If the cc's are known, 
the equations may be solved numerically for the 2/'s. As an example, let 
x 1 = l } #2=3, #3=2. Solving the equations simultaneously, 



2/1 = - 
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In some problems it is convenient, as well as clarifying, to represent a set 
of simultaneous equations in the rectangular notation which is illustrated in 
Table 11. Here the coefficients of the three simultaneous equations have 



Table 11 



3 

120 
3 1 

224 

A 



X 



2/2 



been arranged in the form of a matrix that may be denoted A. The ?/'s have 
been arranged in a vertical column in a matrix denoted y. The x's are also 
arranged in a vertical column matrix denoted x. The matrix A is of order 
3X3, while the matrices x and y are both of order 3X1. 

A matrix that consists of a single column will be called a column vector. 
A matrix that consists of a single row will be called a row vector. The vec- 
torial terminology is probably due to the fact that the elements in any array 
may be regarded as the Cartesian co-ordinates of a point in a space of as 
many dimensions as there are elements in the array. This point, together 
with the origin, determines a direction in space. In this manner any array 
jrf a matrix can be given a vectorial interpretation. 

The three matrices A, y, #, of Table 11 may be regarded as symbolizing 
the simultaneous equations (2). This necessitates that a particular opera- 
tion be implied by the adjacent matrices A and y. In order that these ma- 
trices shall symbolize the simultaneous equations, the following rule must 
be implied in Table 11: 

The first equation in (2) can be produced from the matrices by writing 

a 
(3) 



In performing this operation with the matrices, a row of the first one is 
associated with a column of the second one. The first equation of (2) calls 
for the cross products of the corresponding elements of the first row of A and 
the first column of y. (In the present problem, y has only one column.) The 
cross product is illustrated by (3). 

The second equation (2) is produced by performing the same row~by* 
column multiplication, using the second row of A and the first column of y. 
Then we have 



(4) 



+ 0222/21 + ^232/31 



y-i 
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The third equation (2) is produced by a similar operation on the third 
row of A and the first column of y. The sum of the three products is record- 
ed in the third row and first column of x. The equation is 



(5) 



The three equations may be written in the more condensed form 

3 

(6) 



This interpretation of two adjacent matrices is called matrix multiplication. 
In the present problem k = 1, because y has only one column. Since i can 
take three different values, namely, i = l, 2, 3, the equation (6) represents 
all three of the simultaneous equations in summational notation. Table 11 
represents the rectangular notation. The three equations (2) may also be 
represented conveniently in the still more condensed matrix notation, 
namely, 

(7) Ay=x, 

which is a matrix equation. The operation specified by this matrix equation 
is that if the matrix A is multiplied by the matrix ?/, row-by-column, the 
matrix product is another matrix, namely, x. This is an exceedingly power- 
ful method of handling sets of equations, because many otherwise tedious 
numerical operations can be shunted, so that the calculations are performed 
only on a final set of matrices rather than on many intermediate steps. Still 
more important is the fact that significant relations in a problem are con- 
spicuous in the matrix notation but they may be obscure when the prob- 
lem is handled in expanded algebraic or numerical form. 



3 
2/2 

y' 



Table 12 

i 

102 
2?2 
014 



i 

X 2 



The same set of simultaneous equations (2) may be represented by the 
matrix multiplication shown in Table 12. The multiplication of the first row 
of y' (y f has only one row) and the first column of A r reproduces the first 
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equation of (2). It should be noted that the matrices of Table 12 are the 
transposes of the matrices of Table 11. The transpose of a column vector is 
a row vector with the same elements. 

The summational notation for the matrix multiplication of Table 1$ is 



(8) 



The general element of A in Table 11 is ay. Hence the general element of A' 
in Table 12 is a/. The general element of y is y#, so that the general ele- 
ment of y' is y k} : The matrix equation for Table 12 is 

(9) y'A' = * , 

which represents the same set of equations as (7). 

In order to multiply one matrix by another, the number of columns of 
the first one must be the same as the number of rows of the second. The col- 
umns of A in Table 11 are represented by the subscript /, and this is also the 
subscript for the rows of y. If the subscripts for the first matrix are i and j 
and the subscripts for the second matrix are j and k, then the j subscript is 
eliminated from the matrix product which has the subscripts i and k. The 
same rule can be verified in the matrix multiplication of Table 12, where the 
subscripts of the first matrix are k and j and those of the second matrix 
are j and i. Eliminating the middle subscript j, which is common, the ma- 
trix product has the subscripts k and i. 

The matrix equations (7) and (9) illustrate the following matrix theorem: 

Theorem: The transpose of any product of matrices is the product of their 

transposes in reverse order. 

Hence, if AB=*C, it follows that B f A' = C'. Applying this theorem to the 
present example, we have, by Table 11, Ay=x and, by the theorem, y'A' 
=*%', which is the matrix equation for fable IB. 

If the x ? s are known in (2), then the y's may be fountf, Let the y'a be 
expressed as linear functions of the z's in (10). 



(10) 



2*2 +33 = J/*, 
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This set of simultaneous equations is represented in Table 18. If the three 

Table IS 



1 1 




Zl 




2/i 


021 





* 


- 


2/2 


120 




2 3 




2/3 


B z y 



matrices of Table 13 are denoted B, z, and y, we can represent the three 
equations (10) in the single matrix equation, 



(11) 



Bz = y . 



Since the y's are known, the values of the z's can be determined. Substitut- 
ing the known values of the y's in (10), we find that 



23= 



Equation (7) shows that the re's can be expressed linearly in terms of 
the T/'S. Equation (11) shows that the y's can be expressed linearly in terms 
of the z's. It is desired now to express the x's directly in terms of the z's 
without the intermediate y's. This can be done. From the equations 



Ay = x, 
Bz = y, 

A(Bz) = x , 
ABz = a; . 



(7) 

(ID 

it follows that 

LetAB=C. Then 
(12) 



In order to express the x's in terms of the z's, the matrix product AB = C 
must be determined numerically. This matrix product is shown graphically 
in Table 14. Consider the first row of A and the first column of B. The cross 
product is 

+1. 
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This is therefore the element in the first row and the first column of the 
matrix product C. Consider, as another example, the second row of A and 
the third column of B. The cross product is 



3 . 



This is the element c 2 s in the matrix C. 



Table 


U 






1 


2 







1 





1 




1 


4 


3 





3 


1 








2 


1 


= 


1 


8 


3 


2 


2 


4 




1 


2 







6 


12 


4 




A 




B 




C 





Since the numerical values of the z's, the y's, and the z's are known, the 
matrix equation (12) may be tested graphically, as shown in Table 15. As a 

Table Id 



1 4 3 




-A 




l 


1 8 3 


- 


A 


= 


3 


6 12 4 




-A 




2 


C z x 



sample check, consider the second row of C and the first column of z. It 
should reproduce the value o^ 3. 



, Table 14 shows the matrix product AB = C. If the order of the matrices 
A and B is interchanged in this multiplication, a different product is ob- 
tained. This is readily verified numerically in Table 14; and it illustrates the 
principle that if AB = C, then, in general, BA^C, Matrix multiplication is 
not commutative. In matrix algebra it is essential to note the order of the 
matrix factors because the order is not arbitrary, as in ordinary algebra, 
where ab = ba, 

The following is an example of matrix algebra. If, instead of (7) and (11), 
the transposed forms of these equations were used, we should have 



(9) 
(13) 



y'A' - z' , 
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Substituting (13) in (9), 

(14) JB'A' = x 1 . 
But 

(15) AB = C , 
Hence 

(16) B'A' = C r . 
Substituting (16) in (14), 

(17) z'C' = x f , 

which could also be written directly as the transposed form of (12). 

In order that there shall be a unique solution for the simultaneous equa- 
tions (2), the matrix A of the coefficients must be non-singular, i.e., A \ ^0. 
This may be tested by trying to solve a set of non-homogeneous simultane- 
ous equations with coefficients whose determinant does vanish. 
' The multiplication of matrices is associative. This is illustrated as fol- 
lows; 

(AB)C = A(BC] = ABC . 

The matrix product (AB) may be determined and then postmultiplied by 
C, or the matrix product (BC) may be determined and then premultiplied 
"by A. The product is the same. This principle can be extended to any 
number of matrix factors. For example, 
* 

(ABC)D = (AB)(CU) = A(BCD) = ABCD . 

Note that the order of the matrix factors is retained. 

The sum, or difference, of two mXn matrices is the mXn matrix each of 
whose elements is the sum, or difference, of the corresponding elements in 
the given matrices. 



1 2 
3 4 



2 3 
4 5 



3 5 

7 9 
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The components may be written in any order. 

A + B = B + A ; 

(A + B) + C = A + (B + C) = A+ B + C . 

If k and m are scalars, then 

kA + kB = k(A + B) ; 
kA + mA = (fe + m)A . 

The multiplication of matrices is distributive. 

A(J5 + C) = AB + AC; 

(B + C)A = BA + CA . 

It can be shown that the rank of a matrix product cannot exceed the lowest 
rank of any of the factors. Thus, if the ranks of matrices A, B 3 and C are 
2, 4, and 3, respectively, then the rank of the matrix product ABC cannot 
exceed 2. 

It is sometimes useful to know that the determinant of the product of two 
square matrices is equal to the product of their determinants. The following is 
an example : 



Let | A. | 
Then 



2 6 

4 7 



\B\-\AB\> 



and let \B 



2 1 

3 4 



- +5. 



22 26 
29 32 



- 50 (- 



In the matrix product AB } the matrix B is said to be premultiplied by the 
matrix Aj or the matrix A is said to be postmultiplied by the matrix B. 

The operation of multiplying one matrix by another can be summarized 
for mnemonic purposes in the diagram of Figure L This diagram shows 
that rows of the first matrix are associated with columns of the second 
matrix and that the middle subscript is eliminated in the product. If the 
ith row of A is cross multiplied with the fcth column of J5, the cross product 
is recorded in the cell ik of (7. 

There is nothing magical or profound in the particular rules of matrix 
multiplication that have become conventional. The row-by~column rule is 
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entirely arbitrary. It would have been possible to set up a column-by-row 
rule provided that the matrices had been so arranged that the rule would 
have reproduced the original equations which the matrix notation represent- 



A 



AD-C 



FIGURE 1 



ed. It would also have been possible to have a notation which implied that 
one matrix was on top of another, but this would not have been so conven- 
ient for writing habits that go from left to right. 

Diagonal matrices 

""" In the manipulation of systems of equations there occurs frequently a 
type of matrix in which all of the elements are zero except the diagonal ele- 
ments. A matrix in which only the elements of the principal diagonal are 
non-vanishing is called a diagonal matrix. The following is a diagonal ma- 
trix of order 4: 



a 
0600 
c 
d 



= D = a diagonal matrix. 



It sometimes happens that all of the elements of a diagonal matrix are 
identical. Such a matrix is called a scalar matrix. The following is an ex- 
ample: 

k 



k 
k 
k 



= K = a scalar matrix. 
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When a diagonal matrix has unity in each diagonal cell, it is called a 
unit matrix or the identity matrix. The following is an identity matrix of 

order 4 : 

1000 



0100 
0010 
0001 



= I the identity matrix. 



The properties of diagonal matrices are very useful in handling sets of 
linear equations: 

1) Premultiplication DA with a diagonal matrix D multiplies each row 
of A by the corresponding element in D. 

2) Postmultiplication AD with a diagonal matrix D multiplies each 
column of A by the corresponding element in D. 



p 

q 
D 



a 



A 



a b 

c d 
A 

p 

q 
D 



ap bp 

cq dq 
DA 

ap bq 

cp dq 
AD 



3) Premultiplication or postmultiplication with a scalar matrix K mul- 
tiplies all elements of A by the constant element of K. This is a special case 
of the first two theorems. The reason why premultiplication with a scalar 
matrix has the same effect as postmultiplication is that if every row is mul- 
tiplied by a constant p } the effect is the same as if every column is multiplied 
by the constant p. In either case every element of A is multiplied by p. 



\ b 

d 
A 

p 

p 
K 

p 

p 
K 



p 

p 
K 



a 



b 

c d 
A 

a b 
c d 



ap bp 

cp dp 
AK 

ap bp 

cp dp 
KA 

a b 



c d 
pA 
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Since the effect of a scalar matrix is independent of its position before or 
after the other matrices in a matrix product, its constant element p can be 
used in the product instead of the scalar matrix K as shown in the third ex- 
ample. This illustrates the following theorem: 

4) If K is a scalar matrix, then its constant element p may be substituted 
for the scalar matrix in a product. 

AK = KA = pA = Ap . 

A multiplier which is independent of the non-commutative rule of matrix 
algebra is called a scalar. 

5) To multiply a matrix A by a scalar p, in either order, pA or Ap, is to 
multiply each element of A by p. 

The identity matrix is a special case of the scalar matrix, and hence it is 
also independent of the non-commutative rule of matrix multiplication. 



1 




a 6 




a 6 


1 


* 


c d 




c d 


I A IA=A 



a b 



1 

1 
I 



a b 
c d 



6) To multiply a matrix A by the identity matrix, in either order, AI or 
I A, is to reproduce the matrix A unaltered. 

AI = I A = A . 

The identity matrix I in matrix algebra corresponds to unity in ordinary 
algebra. Hence the identity matrix is suppressed, just as unity is suppressed 
in ordinary algebra. 

1X5 = 5, 

1 X x = x, 
I A = A. 

The inverse 

If a matrix A is non-singular, i.e., |A|p*0, then there exists another 
unique matrix such that its multiplication by A produces the identity ma- 
trix. This other matrix is called the inverse of A, and it is denoted A~ l . 
Hence, if A is non-singular, 

-i = I . 
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The inverse of A" 1 is A f so that 



Consider the ordinary algebraic equation, 

ax = y . 
If it is desired to state x explicitly, the equation is ordinarily written as 

^ 1 

a 

If the given matrix equation is 

AB = C 

and if it is desired to write it explicitly for 5, this cannot be accomplished 
by ordinary division. A matrix is not a number but a rectangular table of 
numbers. There is an operation in matrix algebra which corresponds to 
division in ordinary algebra. If both members of the matrix equation AB 
= C are premultiplied by A* 1 , we have 

A^AB = A^C . 
But 

Hence 

IB = A~ 1 C , 
or 

B = A~ 1 C . 

This is the desired form. This example illustrates the operation in matrix 
algebra which corresponds to division in ordinary algebra. It consists in 
moving a premultiplying or postmultiplying factor from one member of the 
equation to the other member in the same relative position. This operation 
is illustrated in the following examples: 
If 

ABC = M , 

then 

BC - A^M , 

(7 = 



" 1 A = 5- 1 , 
M~ 1 A = C^B^ 
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Since this operation is analogous to ordinary division, the inverse of A is 
sometimes called the reciprocal of A. 

The inverse of any product of matrices is the product of their inverses in 
reverse order, 

Let ABC = M , 

BC = A-W , 
C = &+A- l M , 
I = C^B-^A^M , 



But 

(ABC}- 1 = If- 1 . 
Hence 

(ABC)- 1 = C-IB-M- 1 . 

A method of writing the inverse of a given matrix is as follows: Let the 
given matrix be A with elements a*/. 

1) Write the matrix M with elements m^ which are the first minors of 
the elements a,-; 

2) Reverse the signs of alternate elements of M so that it becomes the 
matrix E with elements e^= ( l)*" 1 "^/; 

3) Write the transpose of E, namely, E'=Fj with elements /#=/. The 
matrix f is the adjoint of A; 

4) Divide each element of F by the value of the determinant \A\. This 
is the inverse A~ l with elements 



The writing of an inverse will be illustrated by the numerical example of 
equations (2). The given matrix equation is 

Ay = x . 
It is desired to find the inverse of A so that the equation 

y _ A~~ I X 
may be written in numerical form. 
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120 
3 1 
224 



= M, 



+14 



10 


-2 


-6 


8 


4 


-2 


2 


1 


3 


10 


2 


-6 


- 8 


4 


2 


2 


i 


3 


10 


-8 


2 


2 


4 


1 


- 6 


2 


3 



tt -1 

A i 

-A 






T* 



A 



It is of interest to verify numerically the matrix equation y=A~ l x. It is 
written in rectangular notation in Table, 16. 

Table 18 



-A A 




1 




-IS 


A A -A 





3 


= 


fl 


-A A A 




2 




A 


A~ l x y 



The characteristic equation 

The characteristic equation is of considerable theoretical interest in fac- 
tor analysis, and it appears in several of the fundamental factor problems. 
For this reason it is described in this introduction. In a more complete 
didactic presentation of this subject, the characteristic equation should be 
introduced with some geometric and other interpretation, so that the gig- 
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nificance of this equation might be apparent. The relation of the charac- 
teristic equation to the problems of factor theory will appear in later 
chapters. 

If a constant /3 is added explicitly to each diagonal element of a square 
matrix A, the resulting matrix is called the characteristic matrix of A. It is 
illustrated as follows: 



&21 #22 #23 



#31 



#33 



1 


100 




+ ft 


010 


= 


+ 


Q 1 
ftl 





(022+0) 



(A+07) 
Characteristic matrix of A. 

The determinant of the characteristic matrix is the characteristic determi- 
nant of A. 

The expansion of a characteristic determinant of order r is a polynomial 
of degree r. When this polynomial is set equal to zero, the equation is called 
the characteristic equation of A. An example is the following equation: 



(1+0) 
1 (2+0) 
1 



1 

1 

(2+0) 



=0. 



When the determinant is expanded, the characteristic equation becomes 
s + 5/3 2 + 7/3 + 2 = . 

The coefficients of the expansion of a characteristic determinant can be 
written in terms of the principal minors without expanding the whole de- 
terminant. Let the characteristic equation be as follows: 



(18) 



m r 



. 



Then the coefficient m x is the sum of all the arrowed principal minors in A. 
The coefficient mi is the sum of all the 1-rowed principal minors of A. These 
are 1, 2, 2, and the sum is +5. The coefficient m^ is the sum of all the 
2-rowed principal minors of A. These are 




2 



= +2; 



1 1 
1 2 



= +1; 



2 1 
2 



'+4. 
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The coefficient m 2 = 2+l+4 = +7. The coefficient ?n 3 is the sum of all prin- 
cipal minors of order 3. This is the determinant A itself and its value is 
+2. The coefficient of the highest power of p is unity. These coefficients 
can be verified by expanding the determinant. 

The summational notation 

If a set of n numbers is to be summed, the operation may be indicated in 
the expanded form, 

(19) xi + # 2 + %* + + x n = y . 

The same operation may be indicated in the more condensed summational 
form, namely, 



(20) > s< = y , 



in which the subscript i takes the successive integral values from 1 to n, 
inclusive. 

In statistical work it is pedantic to indicate the limits, because the sum- 
mation is over the entire population except in rare cases, which can be 
specially indicated. It is acceptable practice in statistical work to write 2# 
without subscripts when the usual form of summation over the population 
is implied. In factor analysis this simple and convenient notation becomes 
ambiguous because summation may be over the factors, over the popula- 
tion, or over the variables. It is therefore advisable to adopt the unam- 
biguous double-subscript notation that is conventional in mathematics. 

As an example, the sum of the elements in the first row of Table 2 can be 
written in the form 



y-x 



Here it is the second subscript j, representing the columns, which is found 
in the summation sign. This means that the a's are to be summed for all 
values of j from 1 ton. The first subscript is fixed. The summation is there- 
by confined to one row. 

The notation can be generalized to represent any row i. It then takes 
the form 



a 



/ 
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This notation means that the a 7 s are to be summed for a fixed value of i, 
since i does not occur in the summation. The a's are to be summed in one 
row i for all the column values of j from 1 to n. 

By analogy, the sum of all the elements in a column j of Table 2 may be 
represented as follows : 

m 

. _|_ . _]_ _L . . . 4. X^ 



Here it is the column j that is fixed because it does not occur in the summa- 
tion sign. The a's are to be summed in some specified column j" for all values 
of i from 1 to m. 

If it is desired to designate the sum of all the elements in the matrix, each 
of the m row-sums must be summed. This involves a summation over both 
i and j. We then have 



Sum of all elements in A = 



Since it does not matter whether the rows are summed before the columns, 
or vice versa, we have 



A matrix multiplication can also be designated by the summational no- 
tation. Consider the matrix multiplication AB = C of Table 17. 



Table 17 



yen) 


*(? 


) 


#11 #12 


#13 












&11 &12 


fcl 


#21 #22 


#23 












3(n) 


621 &22 


&23 


#31 #32 


#33 














&31 &32 


&33 


#41 #42 


#43 






A 


B 





Cll Ci 2 



11 ^22 C23 
Jl <?S2 Css 

AB=C 

Let JL be a matrix of order mXn with general element a t -/. Let J5 be a 
matrix of order nXp with general element &#. Then the product AB = C 
must be a matrix of order mXp with the general element <;. The middle 
subscript j for the general element disappears in the product, and so does 
the middle dimension n in the product (mri) (up) = (mp). In the example, 
m=4, n=3, p = 3. 
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The element Cn is obtained by the cross multiplication of the first row of 
A and the first column of B. In summational notation, 



(21) an&u + aAi + <3i3&3i = 

The second row of A and the first column of B: 

(22) 

The ith row of A and the first column of B: 

(23) 

The ith row of A and the fcth column of B: 
(24) Qftbik + aizbzk + ^is&sfc = 



The summation of (24) gives a single element in C. If it is desired to indi- 
cate the sum of all the elements in the zth row of C, the summation will be 
over k with a fixed value for i. We have then 

p n 

(25) 



Finally, if the sum of all the elements in C is to be indicated, the summation 
must also be made for all rows. Then 



The summation of (24) represents the multiplication of two matrices, a,* 
and &,-&, whose product is the matrix c^. It can be visualized in Figure 1. 
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As an example of manipulation with the summational notation, consider 
the first term of equation (12-i). It is 

N r 

.iWa* rf. 

UjmJ'mi j 



in which it is desired to substitute 



in order to simplify the first term. 

Since the order of summation is arbitrary, the order of the summations 
may be interchanged. Then the first term becomes 



N 



/ -f / j 

m=l i=l 



Since the subscript i does not occur in a| m , this factor is a constant during 
the summation over i. Hence it may be placed in front of the summation 
over i without altering the value of the first term, which then becomes 



N 



m=l 



But the reciprocal of N is a scalar, and so it can be placed anywhere in the 
summation. Changing its relative position, the first term becomes 



N 



and now the substitution can be made more clearly. Suppressing the part 
which is equal to unity, the first term simplifies into 



m=l 



as it occurs in equation (18-i). These steps are more explicit than will or- 
dinarily be found necessary, but they illustrate further the manner in which 
the summational notation can be handled. 
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Linear dependence 

A matrix of order mXn may be regarded as m sets of numbers with n 
numbers in each set. Each row of numbers is then a set. Table 18 is a matrix 

Table 18 

23 51 42 
4 6 10 2 84 
If I i 21 
6 9 15 3 12 6 

of order 4X6. In this table every row can be expressed linearly in terms of 
the first row. By this is meant that for any row i there exists a constant c t 
such that 

(27) c&tj CiQii . 

For the fourth row the constant d is 3, so that each element in the fourth 
row is three times the corresponding element in the first row. When any 
row can be so expressed in terms of the first row, the rows are proportional, 
and it can be shown that the columns are then also proportional. If two 
rows are not proportional, they are said to be linearly independent, for one 
of them cannot be expressed linearly in terms of the other. When each row 
can be expressed linearly in terms of one row, the rank of the matrix is 1. 
This means that all second-order minors vanish. This fact is readily verified 
in Table 18. 

The idea of proportionality can be generalized to two or more dimensions. 
An example of rank 2 is shown in Table 19. In this matrix any row can be 

Table W 

754 672 
220 424 
542 654 
9 7 4 10 9 6 

expressed as a linear function of any two rows. In this particular matrix 
there are no two dependent rows. This requires that two constants c\ and 
C2 (not both zero) exist such that 



(28) 



+ 



where the elements in the ith row are expressed linearly in terms of the 
first two rows. 
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In the following example the two constants for the third row of Table 19 
are determined. For the first two entries in the third row, 

f 5 = 7ci + 2c 2 , 

(29) \ 

[ 4 = 5ci + 2c 2 . 

Solving (29) simultaneously, we find that Ci = l/2 and c 2 = 3/4. Testing this 
on the last column, as an example, 

(2) (*) + (4)(|) -4. 

A different set of constants must be determined for each successive inde- 
pendent row. 

Since each of the rows in Table 19 can be expressed linearly in terms of 
the first two rows, it can be shown that the matrix of Table 19 is of rank 2. 
This implies that all third-order minors vanish. As an example, the fol- 
lowing third-order minor of Table 19 vanishes. 

567 
465 
7 10 9 

It can be shown that if the rank of a matrix is r, then there exists a set 
of r columns, or rows, in terms of which each column, or row, can be linearly 
expressed. 

Geometric interpretations 

The most frequent form of equation for a straight line in a plane is prob- 
ably 

(30) y = mx + p , 

in which x and y are the two variables while m and p are two independent 
parameters. This agrees with the well-known fact that any two points de- 
termine a straight line. The multiplying constant m is the slope, and the 
additive constant p is the ^/-intercept. In the present context it will be more 
useful to begin with the equation of a straight line in the more general form 

(31) a&i + ^2X2 + k = . 



This equation has two variables, xi and x%, and three parameters, a i; a 2 , k. 
Since only two points are needed to determine the line, it follows that the 
three parameters are not independent. 
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Equation (31) can evidently be multiplied by any arbitrary constant 
without affecting its geometrical representation in the plane. Then (31) 
becomes 

(32) caiXi + ca<>x 2 + ck = . 



Let the multiplier c be so chosen that the sum of the squares of the coefficients 
of the variables is equal to unity. Then 

(33) (mi) 2 + (ca 2 ) 2 = 1 , 

or 

<34) " 

Let cai=X 3 ; ca 2 =X 2 ; ck~d. Then 



(35) Xtfi + X 2 z 2 + d - , 
where 

(36) X| + XI = 1 . 

When the equation is written with this adjustment, it is said to be in 
normal form. This definition is applied not only to the equation of a line 
in a plane, and to the equation of a plane in a space of three dimensions, 
but also to the equation of a hyperplane of (n 1) dimensions in a space of 
n dimensions. The number of dimensions of the space defined by equation 
(35) is (n 1) where n is the number of variables. Hence equation (35) de- 
fines a space of one dimension, a line, in a space of n = 2 dimensions, a plane. 

When a linear equation is in the normal form, the parameters have in- 
teresting meaning. The parameters Xi and X 2 are the direction cosines of the 
normal to the linear space which is defined by equation (35); and the 
parameter d is the distance from the origin to the same linear space. The 
normal to the line makes cos^Xi with the # r axis and cos^Xa with the #2- 
axis. The direction cosines of a space are the cosines of the angles that its normal 
makes with the Cartesian co-ordinate axes. In the present case the space is a 
one-dimensional space, namely, that which is defined by equation (35). In 
order to avoid ambiguity, the normal is taken positive on the side which 
contains the origin. Equation (35) may be interpreted geometrically as 
defining a space of one dimension whose normal has the direction cosines 
Xi and X 2 and which is distant d from the origin. 
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If the parameter d vanishes, then the space which is defined by the equa- 
tion contains the origin. In equation (35) the line contains the origin if d 
is zero. 

Equation (35) locates a one-dimensional space (a line) in a two-dimen- 
sional space (a plane). If a new variable, x s , is added, the equation takes 
the form 

(37) XiZi + X 2 x 2 + X 3 # 3 + ^ = . 

It defines a space of two dimensions (a plane) in a space of three dimensions 
with three orthogonal axes. Here, as before, if the equation is in normal 
form, then d is the distance of the plane from the origin, and the three coeffi- 
cients Xi, X 2 , X 3 , are the direction cosines of the normal to the plane. They are 
the cosines of the angles that the normal makes with intersecting lines that 
are parallel to the xi, x%, and x 3 axes, respectively. 

The direction cosines have the property that the sum of their squares is 
unity. In equation (35) the line is defined if the parameter d and one of the 
direction cosines are given. In equation (37) the plane is defined by its dis- 
tance from the origin and any two of its direction cosines. The third direc- 
tion cosine can be found from the fact that the sum of the squares of the 
direction cosines equals unity. If d vanishes in (37), the plane contains the 
origin. A plane through the origin is therefore defined by its direction co- 
sines, which are the direction cosines of its normal. 

An equation of the same form in n variables is 

(38) Xi3?i + X 2 # 2 + X 3 Z 3 + . . . + \n%n + d = . 

It defines a hyperplane of (n 1) dimensions in a space of n dimensions. If 
(38) is in normal form, d is the distance of the hyperplane from the origin. 
The X's are the direction cosines of .the normal to the hyperplane and hence 



(39) X? = + 1 . 

=1 

In the present factor analysis the hyperplanes of primary interest contain 
the origin, so that the parameter d vanishes. Then 



(40) Xi#i + X 2 ^2 + X 3 #3 + + X n Z n = . 

The n values of X t are said to be the direction cosines of the hyperplane L 
which is defined by its normal A. 
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A matrix may be given a geometric interpretation. Let the matrix A be 
of order mXn. Then the n elements of each row may be regarded as the 
Cartesian co-ordinates of a point in n dimensions. Since there are m rows, 
the matrix may be thought of as defining the positions of m points, one for 
each row, in a space of n dimensions. Table 20 is of order 6X3. It can there- 
fore be regarded as defining the positions of six points in a space of three 
dimensions. 

Table 20 
1 2-4 
-4 1 -11 
-2 -3 5 

3 5-9 

4 -2 14 
1-3 

Let the rank of an mXn matrix A be r. Then it can be shown that the m 
points are contained in a space of r dimensions which also contains the 
origin. The rank of the matrix of Table 20 is 2. Hence the six points should 
lie in a plane which contains the origin. The equation of such a plane is 



(41) 



"f" 



~ , 



in which the # J s are the three co-ordinates of each of the points in the plane 
and the X's are the direction cosines of the plane. The X's are not independ- 
ent parameters because of the conditional equation (39). Hence any two X's 
define the plane. These may be found by any two of the six points which are 
not collinear with the origin. Since no two rows of Table $0 are propor- 
tional, no two of the points are collinear with the origin. If two such points 
were found, then these two points would define a line through the origin and 
not a plane. 

Substituting the x's of the first two points of Table 20 in (41), 



(42) 



f Xi + 2X 2 - 4X 3 
j - 4Xx + X 2 - 11X 3 
Solving for Xi and X 2 in terms of Xg, we have 

fXi - - 2X 3 , 
1 X 2 - 3X 3 . 



o, 

0. 



(43) 
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Substituting (43) in (39), 

(44) 4X1 + 9X| + \i - 1 , 

or 

X3 = 7ll' 
and hence 

1 " ~ 1/14" 
3 



,~ ~ 

T/14 
The equation of the plane in normal form is therefore 

231 

(45). -- 7 a?i + 7=7 #2 + -7= 3 = . 

T/14 1/14 T/14 

All of the four remaining points must lie in this plane, since the rank of 
Table 20 is 2. The three coefficients are the direction cosines of the plane, 
i.e., the direction cosines of the normal to the plane. 

The distance of a point from the origin is 1/S# 2 , where the x's are its 
co-ordinates. For example, the distance of the fourth point from the origin 
is l/3 2 +5 2 +( 9) 2 = 10.72. If the sum of the squares of the co-ordinates of 
a point is equal to unity, then the point is at unit distance from the origin. 

Each point may be interpreted as defining the terminus of a vector from 
the origin. The scalar product of any two vectors is hih^ cos <, where hi and 
Tte are the lengths of the vectors and < is their angular separation. If the two 
vectors are of unit length, then the scalar product is the cosine of the angular 
separation. It can be shown that the scalar product of two vectors can be 
expressed in the form 



y-i 



where i and I refer to points (rows) and .7* refers to co-ordinates (columns). 
For example, the scalar product of the vectors defined by the second and 
fourth points of Table 20 is 

(-4)(3) + (1)(5) + (-11X-9) = + 92 . 
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If the sum of the squares of the co-ordinates of each point is equal to unity, 
so that the points lie at unit distance from the origin, then the scalar prod- 
uct, or cross product, is the cosine of the angular separation of the vectors 
at the origin. 

Orthogonal transformations 

The three simultaneous equations (2) may be regarded as representing 
the three co-ordinates of a point x (xi, x%, x 3 ), the three co-ordinates of a 
point y (yi, y^, ys), and the law by which each point y is transformed into a 
corresponding point x. For every point y, there exists some other point x 
whose co-ordinates can be found by (2) when the co-ordinates of y are 
known. This relation is called a linear transformation by which the points y 
are moved to the corresponding points x. Every pair of the corresponding 
points are related by the linear transformation (2). The transformation is 
called linear when the equations by which the B'S can be found from the 
2/'s are of the first degree, as is the case in (2). 

If the transformation is of such a nature that the x's can be obtained 
from the y's by merely rotating the co-ordinate axes, then the transforma- 
tion is an orthogonal transformation or rotational transformation. In order 
that a transformation shall be orthogonal, it is evidently necessary that the 
x's be at the same distance from the origin as the corresponding y's because 
the distance of a point from the origin remains invariant when the co-ordi- 
nate axes are rotated. It is also necessary that the angular separations, or 
scalar products, be invariant, because the configuration of the points is not 
altered by rotating the co-ordinate axes. 

The matrix A of Table His called the matrix of the transformation when it 
is regarded as the relation by which the points y are changed into the 
points x. A linear transformation is represented in the more general form 
by the square matrix of Table 21. It can be shown that a square matrix is 

Table $1 

#11 #12 #18 ... #1 
#21 #22 #23 #2 
#31 #32 #33 ... #3 



orthogonal, i.e., that it has the effect of rotating a set of points y into a set 
of points x, with the same configuration as the y's, if it satisfies the following 
conditions: 
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1) The sum of the squares of the elements in each row is equal to 
unity, i.e., 

(46) 

and 

2) The cross product of every pair of rows, i and Z, is equal to zero, i.e., 



(47) / ^ difau , 

ji 

when 17*1. It is immaterial whether the rotation is conceived as a rotation 
of the orthogonal co-ordinate axes in a fixed configuration of points or as a 
rotation of the configuration in a fixed reference frame of the co-ordinate 
axes. The result is the same. 

The two conditions, (46) and (47), are of such frequent occurrence that 
it is sometimes convenient to combine them in a single statement. This can 
be done by writing the conditions in the more condensed form 



(48) S z -z = 

:?=! 

where the symbol 5n is known as Kronecker's delta. It is defined as follows: 

da = + 1 when i = I , 
du = when i ^ I . 

It can be seen that with this definition of 5, the single statement (48) 
covers the two statements (46) and (47). 

If the matrix of a transformation is square and if it satisfies (48), then the 
following conditions are also satisfied: 

3) The sum of the squares of the elements in each column is unity, i.e., 
(49) 
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4) The cross product of any pair of columns is zero, i.e., 



(50) 



/* 

Va-o-i = 

2_J 11 ' ' 



ft, 



where j and k refer to columns. 

5) The determinant of the transformation is 1, i.e., 



(51) 



\A\ - 1 



If an orthogonal co-ordinate axis is reversed in direction, then the corre- 
sponding co-ordinate for each point is reversed in sign. If an odd number of 
orthogonal co-ordinate axes are reversed in direction in an orthogonal trans- 
formation, then the determinant of the transformation is equal to 1. If 
an even number of axes are reversed, the determinant is equal to + 1. These 
two statements can be made with reference to the configuration. If the ro- 
tational transformation retains the configuration of the points, the determi- 
nant of the transformation is equal to +1. If the rotation involves a sym- 
metric distortion of the configuration, the determinant of the orthogonal 
transformation is equal to 1. 



1 -2 



1 4 
5 -2 
4 1 



Ta6fe # 
Fx F 2 
,866 -.500 || 

.500 .866 || 
G 



- .134 


2/2 

-2.232 


a 


3.232 


1.598 


b 


1.134 


3.964 


c 


3.330 


-4.232 


d 


3.964 


-1.134 


e 



Each column of an orthogonal transformation shows the direction cosines 
of one of the new co-ordinate axes, referred to the given co-ordinate axes. 

A rotation of the co-ordinate axes implies that the given configuration 
and the transformed configuration are contained in the same space. The 
rows of a transformation correspond to the dimensions of the given configu- 
ration, and the columns of a transformation correspond to the dimensions 
of the new configuration. If the transformation is merely a rotation of axes, 
it is evident that the matrix of an orthogonal transformation is necessarily 
square. If the matrix of a transformation is of order mXn where m^n, 
then the transformation cannot be orthogonal, since the number of dimen- 
sions of the given configuration and the number of dimensions of the trans- 
formed configuration are not the same. However, such a matrix may satisfy 
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condition (47), and it is then said to be orthogonal by rows. If the condition 
is satisfied for columns, as in (50), instead of for rows, then the matrix is 
said to be orthogonal by columns. 

Table 22 is a numerical example of the rotation of five points in a plane. 
A is a matrix of order 5X2. The orthogonal transformation G is of order 
2X2. The matrix product AG = B is of order 5X2, and it shows the co- 



\ 
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o 
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o 

d 
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\ 



FIGURE 2 



ordinates of the same five points with reference to the new rotated co- 
ordinate axes. In Figure 2 the five points have been plotted for the given 
orthogonal co-ordinates axes, X-L and X%, that are implied in A. 

Let it be desired to rotate these axes through an angle 0=30. The usual 
formulas for rotation of axes* can be written in the form of a 2X2 transfor- 
mation as follows: 



cos 



sin 



sin <f> 
cos (t> 



* W. F. Osgood and W. C. Graustein, Plane and Solid Analytic Geometry (New York: 
Macmillan Co., 1929), p. 220. 
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The first condition (46) gives 

cos 2 $ + sin 2 4> = + 1 , 
and the second condition (47) gives 

cos 4> sin $ cos < sin < == . 

Hence this is an orthogonal transformation in which the other properties 
may be readily verified. 

X 3 



45 



b 

A 



XL. 



e 
o 



o 
a 



o 
d 



FIGURE 3 

Substitution of < = 30 in the transformation produces the matrix (?, and 
the multiplication AG produces B. The numerical values in B may be 
checked in Figure #, where Yi and F 2 have been drawn so that YiOXi=*<f> 
=30. For example, the co-ordinates of the second point can be measured 
on the graph to be 3.23 and 1.60 for the two rotated axes, while they are 
2 and 3 for the two original axes. This figure illustrates the geometric in- 
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terpretation of an orthogonal transformation. The two columns of the trans- 
formation G show the direction cosines of the new orthogonal reference 
vectors FI and F 2 . 

Oblique transformations 

In Figure 2 the two co-ordinate axes FI and F 2 are orthogonal. If it is 
desired to define the n points with reference to oblique reference axes, the 
transformation is effected in a similar way. In Figure 3 the same five points 
are plotted on the Xi and X* axes which are implied in the given matrix A 
of Table 22. Here the new axes are oblique; Z\ is rotated 30 from Xi, and 
Z 2 is rotated 45 from X 2 . 

In Table 23 are shown the numerical values for the corresponding oblique 
transformation. The matrix A contains the co-ordinates of the given five 

Table 23 



1 -2 

2 3 
-1 4 

5 -2 

4 1 



+ .866 -.707 
+ .500 +.707 

H 



- .134 -2.121 

3.232 .707 

1.134 3.535 

3.330 -4.949 

3.964 -2.121 



points in a space defined by the two orthogonal reference axes Xi and X%. 
The matrix H is a square matrix of order 2 X 2. It is the matrix of the oblique 
transformation. Its columns show the direction cosines of the new oblique 
co-ordinate axes Zi and Z 2 . These direction cosines may be verified in 
Figure 3. 

It should be noted that the sum of the squares of each column of H is 
equal to unity. Each column of H may be regarded as defining a unit vec- 
tor in a space of two dimensions. The cross product of the columns of 
ffis 



which is the cosine of the angle between Zi and Z 2 . 

The product AH=C shows the projection of each of the five points on 
each of the oblique axes Z\ and Z 2 . This interpretation can be verified by 
actual measurement on Figure 3. 



CHAPTER I 

THE FACTOR PROBLEM 
On the nature of science 

This volume is concerned with methods of discovering and identifying 
significant categories in psychology and in other social sciences. It is there- 
fore of interest to consider some phases of science in general that bear on 
the problem of finding a methodology for a psychological science. 

It is the faith of all science that an unlimited number of phenomena 
can be comprehended in terms of a limited number of concepts or ideal 
constructs. Without this faith no science could ever have any motivation. 
To deny this faith is to affirm the primary chaos of nature and the con- 
sequent futility of scientific effort. The constructs in terms of which nat- 
ural phenomena are comprehended are man-made inventions. To dis- 
cover a scientific law is merely to discover that a man-made scheme serves 
to unify, and thereby to simplify, comprehension of a certain class of natu- 
ral phenomena. A scientific law is not to be thought of as having an inde- 
pendent existence which some scientist is fortunate to stumble upon. A 
scientific law is not a part of nature. It is only a way of comprehending 
nature. 

A simple example is the concept "force." No one has ever seen a force. 
Only the movement of objects is seen. The faith of science is that some 
schematic representation is possible by which complexities of movement 
can be conceptually unified into an order. The error of a literal interpreta- 
tion of a force vector as the pictorial representation of a corresponding 
physical entity is seen in the resolution of forces. If a particle moves with 
uniform acceleration in a certain direction, it is, of course, possible to de- 
scribe the movement by one force, or by two, or by three or more coplanar 
forces. This resolution of a movement into several simultaneous and super- 
imposed movements is frequently done in order that a convenient and habit- 
ual reference frame may be retained. While the ideal constructs of science 
do not imply physical reality, they do not deny the possibility of some de- 
gree of correspondence with physical reality. But this is a philosophical 
problem that is quite outside the domain of science. 

Consider, as another example, Coulomb's inverse-square law of electrical 
attraction. A postulated force is expressed as a function of the linear sepa- 
ration of the charges. Now, if the charges were to be personified, they would 
probably be much surprised that their actions were being described in terms 
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of their linear separations. No one assumes that there is a string between 
the charges, but Coulomb's law implies that the length of such a string is to 
be used in our simplified scheme of comprehending the postulated charges. 
It is more likely that the whole space surrounding the charges is involved in 
the phenomena of attraction and that Coulomb's law is a fortunate short- 
cut for representing approximately a part of the phenomena that are called 
charges and attractions. It is not unlikely that all of these entities will 
eventually vanish as such and become only aspects of an order more in- 
volved than Coulomb's law implies but not so chaotic as to individualize 
completely every moment of nature. 

A science of psychology will deal with the activities of people as its cen- 
tral theme. A large class of human activity is that which differentiates in- 
dividuals as regards their overt accomplishments. Just as it is convenient 
to postulate physical forces in describing the movements of physical ob- 
jects, so it is also natural to postulate abilities and their absence as primary 
causes of the successful completion of a task by some individuals and of the 
failure of other individuals in the same task. 

The criterion by which a new ideal construct in science is accepted or re- 
jected is the degree to which it facilitates the comprehension of a class of 
phenomena which can be thought of as examples of a single construct 
rather than as individualized events. It is in this sense that the chief object 
of science is to minimize mental effort. But in order that this reduction shall 
be accepted as science, it must be demonstrated, either explicitly or by im- 
plication, that the number of degrees of freedom of the construct is smaller 
than the number of degrees of freedom of the phenomena that the reduction 
is expected to subsume. Consider, as an example, any situation in which a 
rational equation is proposed as the law governing the relation between two 
variables. If three observations have been made and if the proposed equa- 
tion has three independent parameters, then the number of degrees of free- 
dom of the phenomena is the same as the number of degrees of freedom of 
the equation, and hence the formulation remains undemonstrated. If, on 
the other hand, one hundred experimentally independent observations are 
subsumed by a rational equation with three parameters, then the demon- 
stration can be of scientific interest. The convincingness of a hypothesis 
can be gauged inversely by the ratio of its number of degrees of freedom to 
that of the phenomena which it has demonstrably covered. It is in the na- 
ture of science that no scientific law can ever be proved to be right. It can 
only be shown to be plausible. The laws of science are not immutable. They 
are only human efforts toward parsimony in the comprehension of nature. 

If abilities are to be postulated as primary causes of individual differences 
in overt accomplishment, then the widely different achievements of indi- 
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vi duals must be demonstrable functions of a limited number of reference 
abilities. This implies that individuals will be described in terms of a limited 
number of faculties. This is contrary to the erroneous contention that since 
3 very person is different from every other person in the world, people must 
aot be classified and labeled. 

Each generalization in the scientific description of nature results in a loss 
n the extent to which the ideal constructs of science match the individual 
events of experience, This is illustrated by simple experiments with a pen- 
lulum in which the mass, the period, and the locus of the center of gravity 
jvith reference to a fulcrum are involved in the ideal construct that leads 
;o experimental verification. But the construct matches only incompletely 
;he corresponding experimental situation. The construct says nothing about 
;he rusty set screw and other extraneous detail. From the viewpoint of im- 
nediate experience, scientific description is necessarily incomplete. The sci- 
entist always finds his constructs immersed in the irrelevancies of experi- 
ence. It seems appropriate to acknowledge this characteristic of science in 
/iew of the fact that it is a rather common notion that the scientific de- 
scription of a person is not valid unless the so-called "total situation" has 
:>een engulfed. A study of people does not become scientific because it at- 
tempts to be complete, nor is it invalid becaxise it is restricted. The scientific 
lescription of a person will be as incomplete from the viewpoint of common 
sense as the description of other objects in scientific context. 

The development of scientific analysis in a new class of phenomena usual- 
y meets with resistance. The faith of science that nature can be compre- 
lended in terms of an order acknowledges no limitation whatever as regards 
classes of phenomena. But scientists are not free from prejudice against the 
extension of their faith to realms not habitually comprehended in the 
scientific order. Examples of this resistance &re numerous. It is not infre- 
quent for a competent physical scientist to declare his belief that the phe- 
lomena of living objects are, at least in some subtle way, beyond the reach 
>f rigorous scientific order. 

One of the forms in which this resistance appears is the assertion that, 
:ince a scientific construct does not cover all enumerable detail of a class 
>f phenomena, it is therefore to be judged inapplicable. Since the analysis 
>f cell growth by mathematical and physical principles does not cover every- 
hing that is known about cells, the biologist judges the analysis to be inap- 
>licable. Since no mathematical analysis that can be conceived would cover 
,11 the subtle mysteries of personality, this realm is frequently judged to be 
mtside the domain of rigorous science. But physical scientists accept rigor- 
>us scientific analyses about physical events that leave fully as much beyond 
he scientific constructs. Every explosion in the world has been different 
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from every other explosion, and no physicist can write equations to cover 
all of the detail of any explosive event. It is certain that no two thunder- 
storms have been exactly alike, and yet the constructs of physics are applied 
in comprehending thunder and lightning without any demand that the de- 
tail of the landscape be covered by the same scientific constructs. 

The attitudes of people on a controversial social issue have been appraised 
by allocating each person to a point in a linear continuum as regards his 
favorable or unfavorable affect toward the psychological object. Some so- 
cial scientists have objected because two individuals may have the same 
attitude score toward, say, pacifism, and yet be totally different in their 
backgrounds and in the causes of their similar social views. If such critics 
were consistent, they would object also to the statement that two men have 
identical incomes, for one of them earns when the other one steals. They 
should also object to the statement that two men are of the same height. 
The comparison should be held invalid because one of the men is fat and 
the other is thin. This is again the resistance against invading with the 
generalizing and simplifying constructs of science a realm which is habitual- 
ly comprehended only in terms of innumerable and individualized detail. 
Every scientific construct limits itself to specified variables without any 
pretense to cover those aspects of a class of phenomena about which it has 
said nothing. As regards this characteristic of science, there is no difference 
between the scientific study of physical events and the scientific study of 
biological and psychological events. What is not generally understood, even 
by many scientists, is that no scientific law is ever intended to represent 
any event pictorially. The law is only an abstraction from the experimental 
situation. No experiment is ever completely repeated. 

There is an unlimited number of ways in which nature can be compre- 
hended in terms of fundamental scientific concepts. One of the simplest 
ways in which a class of phenomena can be comprehended in terms of a 
limited number of concepts is probably that in which a linear attribute of 
an event is expressed as a linear function of primary causes. Even when the 
relations are preferably non-linear and mathematically involved, it is fre- 
quently possible to use the simpler linear forms as first approximations. A 
well-known example of this type of relation is that in which the chroma of a 
spectral color is expressed as a linear function of two arbitrarily chosen pri- 
maries. If two spectral colors are chosen arbitrarily for use as primaries, 
it is possible to express any intermediate color as a linear function of the 
two arbitrarily chosen primaries. The coefficients of the two terms of this 
linear function represent the angular sizes of the two sectors into which a 
color rotator is divided. When the rotator is spun, the intermediate color 
is seen. But here, as elsewhere in science, although the chroma of the result- 
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ing color is expressed in terms of the linear function of the arbitrary pri- 
maries, it does not follow that the saturation or gray-value is expressed by 
the same law. There is still debate about which colors are to be considered 
primary. This question can be settled only by discovering that a certain set 
of primaries gives the most parsimonious comprehension of some phase of 
color vision. A parallel in the description of human traits is their descrip- 
tion, in first approximation, as linear functions of a limited number of ref- 
erence traits. The final choice of a set of primary reference traits or faculties 
must be made in terms of the discovery that a particular set of reference 
traits renders most parsimonious our comprehension of a great variety of 
human traits. 

Psychological postulates and definitions 

The factorial methods have been developed primarily for the purpose of 
analyzing the relations of human traits. These are defined as follows: 

Definition 1. A trait is any attribute of an individual. 

The factorial methods are applicable also in .the analysis of attributes of 
inanimate members of a group. The members of a statistical population 
may be moments in time or regions in space or any other entities, each of 
which has a set of attributes. This generalization will not be made explicit- 
ly, but it is implied in the following chapters. Since the methods have been 
developed primarily with psychological categories in mind, these will be 
explicitly discussed even though the same methods are applicable to prob- 
lems which involve the attributes of inanimate members of a statistical 
group. 

It is useful to distinguish between those traits which are descriptive of 
the individual as he appears to others and those traits which are exemplified 
primarily in the things that he can do. This distinction is involved in the 
definition of "ability." 

Definition 2. An ability is a trait which is defined by what an individual 

can do. 

This definition implies that there are as many abilities as there are enumer- 
able things that individuals can do. Each ability is therefore objectively de- 
fined in terms of a specified task and of a specified method of appraising it. 

Definition 3. The task, together with the method of appraising it, which 
defines an ability is called a test* 

Definition 4. The linear evaluation of a test performance is called a score. 

It is implied in these definitions that an index of ability is co variant with 
the score in the test which defines the ability, and that a true index of 
ability is co variant with the true score in the test. 

Let there be N individuals in an experimental population, and let there 
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be n tests. Let SH be the raw score of individual i in test j, and let EH be 
the absolute variable error of the score. Then the true score TH of individual 
i in test j is defined by the relation 



In psychological investigations it is sometimes desirable to postulate 
that the frequency distribution of the indices of a particular ability is nor- 
mal in the experimental population, while in some investigations this is not 
a desirable restriction. Hence two indices of ability will be defined in ac- 
cordance with these two cases. Both indices are so defined as to be covariant 
with the true score in the test which defines the ability, but they differ as 
regards the assumption of normality of the distribution of ability in the ex- 
perimental population. 

Case 1, assuming that the distribution of ability is not necessarily Gaussian: 
Let Vji^aTji+bj in which the parameter 6 and the positive parameter a 
are so chosen that the following conditions are satisfied: 

N 



1=1 

AT 



2) 2^ = N . 
1=1 

Then VH is an index of the ability j in individual i which is a linear function 
of the true score TH in the test j. Since this index is a linear function of the 
true score, it follows that the shape of the frequency distribution of true 
scores is retained in the frequency distribution of indices of the correspond- 
ing ability. This index will be called the standard score in ability j. 
Case 2, assuming that the distribution of ability is Gaussian: 
Let </>(Tji)tji be the monotonie increasing function which satisfies the 
three following conditions: 

N 



N 

2) 

3) the frequency distribution of tn in N is Gaussian. 
The index tn will be called the normalized standard score in ability j or simply 
the standard score in ability j. It is assumed that in each investigation in 
which factorial methods are used, the statement will be explicitly made as 
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to whether a Gaussian distribution of ability has been assumed for the ex- 
perimental population. The frequency distribution of raw scores in psycho- 
logical tests is arbitrary, since the scores can be adjusted as regards skew- 
ness in any desired direction and to any desired extent by merely inserting 
or removing relatively easy or relatively difficult test items. Since this is a 
matter of judgment on the part of the person who assembles the psychologi- 
cal tests, no inference can be made concerning the skewness or normality of 
the distribution of a particular ability from the arbitrary skewness or arti- 
ficial normality of the distribution of raw scores. 

It is desirable to develop the factorial methods in such a manner that they 
are independent of the assumption of normality of ability in any particular 
experimental population. In the present theoretical development of the factorial 
methods it will not be assumed that any of the distributions of ability are normal. 

The application of the factorial methods in science rests on a fundamental 
postulate. 

Postulate. The standard scores of all individuals in an unlimited number of 
abilities can be expressed, in first approximation, as linear functions 
of their standard scores in a limited number of abilities. 

The correlation between the true scores in two tests will be referred to as 
the correlation between the two abilities which are defined by the tests. In 
statistical work it is customary to refer to two variables as independent 
when their correlation is zero. The term independence will be used with three 
different meanings. They will be designated by appropriate adjectives un- 
less the context makes the designation unnecessary. 

Definition 5. A set o/n abilities are linearly independent if the rank of the 
matrix of their true interpretations is n. 

Definition 6. Two abilities are statistically independent in a population if 
their correlation is zero in that population. 

Definition 7. Two observations are experimentally independent if they are 
experimentally distinct, so that one is not derived from the other by a 
constraint either of the experimental situation or of the computations. 

In one sense, no two observations can ever be experimentally independ- 
ent. The term can be used only with reference to the state of knowledge 
at the time the observations are made. 

It is clarifying to interpret geometrically the relations of abilities. In 
such a context two abilities that are uncorrelated in a population will be 
called orthogonal in that population. Two abilities that are correlated in a 
population will be called oblique in that population. 

There is special interest in the limited number of abilities in terms of 
which all other abilities can be defined, since these are the landmarks in 
terms of which all abilities can be comprehended. 
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Definition 8. If the standard scores of N individuals in n abilities are ex- 
pressed as linear functions of their scores in r linearly independent 
abilities, where r < n, then the r abilities will be called reference abilities. 
It will be shown that if a battery of tests can be described with reference 
to r orthogonal abilities, there exists an infinite number of sets of r orthogo- 
nal abilities in terms of which the description can be made with equal ac- 
curacy. An arbitrary set of r orthogonal abilities may be chosen for pur- 
poses of description. These are the statistically independent or orthogonal 
reference abilities. If a battery of tests can be described in terms of r or- 
thogonal reference abilities, the tests can also be described by a set of r 
oblique reference abilities. It is not necessary that a reference ability be 
represented by a test in which it is involved exclusively. While each of the 
tests that are used in experimental work defines an ability, it may happen 
that the reference abilities in terms of which tests and individuals are de- 
scribed are not represented by actual tests but by linear combinations of 
several tests. A linear combination of tests may be thought of as a com- 
posite test. 

The nXr matrix of coefficients of the r reference abilities in terms of 
which the standard scores in each of the n abilities can be linearly expressed 
is not unique. The most parsimonious comprehension of the n abilities in 
terms of r reference abilities is obtained when the number of vanishing 
coefficients of the n linear functions is maximized. 

Definition 9. If the N standard scores in each of n diversified abilities can 
be expressed as linear functions of fewer than r of the r independent ref- 
erence abilities j then that set of r reference abilities for which the num- 
ber of vanishing coefficients is a maximum will be called primary 
abilities. 

If a large and diversified battery of tests can be described in terms of r ref- 
erence abilities and if a particular set of r primary abilities can be found 
such that each test can be described in terms of less than r of these abilities, 
then the primary abilities have significance because of their identification 
with phenomena extraneous to the test scores and their intercorrelations 
even if the extraneous phenomena are unknown. 

It is conceivable, and not improbable, that some reference abilities will 
be found to be sufficiently elemental that they can be declared to be either 
present or absent in each individual without intermediate gradations in 
amount or degree of presence. 

Definition 10. If only two numerical values occur in the population N for 
the standard scores in a primary ability , then the primary ability is a 
unitary ability. 
This is a genetic interpretation of factors. 
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The underlying idea from which the present factorial analysis originates 
is a very simple one. If there are N individuals in a random sample of the 
population and if each of these individuals has demonstrated his abilities 
by doing his best on n separate tests, then there will be nN test scores to be 
explained. At the present stage of development of psychology and of ge- 
netics there are no available ideal constructs for representing the mental 
abilities. The simplest possible formulation seems to be an analysis of the 
variance of each test into linear components.* It is almost certain that this 
simple type of analysis will not be the ultimate one, but it is likely that the 
principal primary abilities will be discovered by factorial analysis of the 
variance of each test. As soon as some of the primary abilities have been 
isolated, detailed studies of inheritance should be undertaken. 

The performance of an individual on a test is determined in part by the 
abilities that are called for by the test and in part by the degree to which 
the individual possesses these abilities. An individual's performance on a 
test may be regarded as a sum of the contributions of his primary abilities. 
His abilities are not called for to the same extent by the different tests, and 
it therefore seems natural to describe each of his test scores as a sum of 
weighted linear contributions of his different primary abilities. The weights 
are descriptive of the tests. This simple formulation of the problem is 
flexible enough to serve the descriptive purposes of psychology until more 
refined, and perhaps less obvious, constructs will be called for by future ex- 
perimental inquiry and by the attainment of more accurate psychological 
measurement than now seems to be possible. 

The assumption that a performance can be describee! approximately as a 
sum of weighted linear contributions of several independent factors can be 
represented in the following equation :f 

(1) UjlXli + Q>$&2i + &/3#3l + . . + CijqXqi = Sji , 

in which $/< represents the standard score of individual i in test j. The x's 
represent standard scores in the q statistically independent arbitrary ref- 
erence abilities, while the a's represent factor loadings in the tests. The x's 
describe the individuals, and the a's describe the tests. The first term repre- 
sents the contribution of the arbitrary reference ability No. 1 to the test 
performance $# It is determined by the amount of the first arbitrary ref- 
erence ability that the subject possesses, namely, xu, and the extent to 
which the test calls for the first ability, namely, a,i. Similar reasoning ap- 
plies to the contribution of each other ability to the test performance. If 
the primary abilities are oblique, these orthogonal reference abilities may 

* The variance is the square of the standard deviation, 
t "Multiple Factor Analysis," p. 409. 
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be regarded as arbitrary. They can be rotated or transformed into the pri- 
mary abilities by methods that will be described. 

There is no loss of generality in reducing all performances to standard 
scores. This reduction involves a translation of the origin of the raw test 
scores to the mean score of the distribution and a stretching of the scale so 
as to make the standard deviation unity. The shape of the distribution is 
not altered by reducing the raw scores to standard scores. 

It should be noted that, even if each individual can be described in terms 
of a limited number of independent reference abilities, it is still possible for 
every person to be different from every other person in the world. Each 
person might be described in terms of his standard scores in a limited num- 
ber of independent abilities. The number of permutations of these scores 
would probably be sufficient to guarantee the retention of individualities. 

With a limited number of abilities this formulation not only allows that 
every person shall be different from every other person but it also allows 
the widest possible differences between several individuals who attain the 
same objective performance in a test. This may be readily seen by consider- 
ing a hypothetical example. Assume that a test calls for two abilities, such 
as ability in abstraction and ability in the manipulation of numbers. Several 
individuals try the test and attain the same score. One of them may possess 
a high degree of ability in making the abstractions involved in the test, but 
he may be slow in numerical manipulation. Another may be slow in formu- 
lating the abstract part of the problem, but he may make up for this de- 
ficiency by superior numerical speed. The objective result might be the 
same. The purpose of factor analysis is to obtain a quantitative description 
of each primary ability in each individual by means of tasks that require 
these abilities in different amounts. Since every task is probably composite 
in the primary abilities required, it is necessary to make the appraisal of the 
abilities of individuals by analytical methods. This is exactly the object of 
the multiple-factor methods as applied to the problem of describing the 
abilities of people. 

Factor analysis is reminiscent of faculty psychology. It is true that the 
object of factor analysis is to discover the mental faculties. The severe re- 
strictions that are imposed by the logic of factor analysis make it an arduous 
task to isolate each new mental faculty, because it is necessary to prove that 
it is called for by the experimental observations. Factor analysis does not 
allow that a new faculty be added as soon as a new name can be found for 
the things that people can do. In order to prove that reasoning and ab- 
straction are two different faculties, for example, it will be necessary to 
show that the tasks which call for such activities really do involve two fac- 
tors, and not one. 

There is an interesting difference between the logic of multiple correlation 
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and that of factor analysis. In multiple correlation it is necessary to desig- 
nate one of the variables as dependent and all of the others as independent, 
and the problem is then to predict the one test score from all the rest. In 
factor analysis there is no problem of prediction of any test scores, and there 
is no distinction between independence and dependence among the given 
variables. The dependent variables are the primary abilities of the indi- 
vidual subjects which are to be estimated in terms of the given tests. 

In the psychology of the future it may be found useful to postulate a dif- 
ferent form of ideal construct for the description of mental endowment than 
the simple one that is implied in equation (1). The ideal constructs of the 
future may involve elements with location in a space frame with spatial, 
dynamic, and temporal constraints analogous to the ideal constructs of 
genetics. It would be unfortunate if some initial success with the analytical 
methods to be described here should lead us to commit ourselves to them 
with such force of habit as to retard the development of entirely different 
constructs that may be indicated by improvements in measurement and by 
inconsistencies between theory and experiment. 

Matrix formulation 

Let N be the number of individuals in a random sample of the population, 
and let n be the number of tests from which the primary abilities are to be 
isolated. The raw data for factorial analysis consist of the entries in an 
nXN table of standard scores in which each of the N subjects is represented 
by n test scores. This table will be referred to as an nXN score matrix S. 

Equation (1) implies that the matrix S is the product of two matrices, 
namely, one matrix with elements a which are descriptive of the tests and 
another matrix with elements x which are descriptive of the individuals. 
The former will be called the factorial matrix F and the latter the popula- 
tion matrix P. 

In setting up these two matrices, an assumption will be made concerning 
the nature of the factors in the present psychological problem, namely, that 
there are at least three kinds of factors involved in the variance of each test. 
These factors are (a) the common factors or abilities, (6) the specific factors 
or abilities, and (c) the chance error factors. By common factor is meant any 
factor or ability which is called for by more than one of the n tests in a 
battery. By specific factor or ability is meant any factor or ability which is 
called for by only one of the n tests. By error factor is meant the variable 
chance error which is a part of the total variance of the test. 

It is evident that an ability which is a common factor for a test in one bat- 
tery may become a part of the specific factor in the same test when it is placed in 
another test battery. Whether any particular ability is common or specific 
depends on the battery as a whole. 
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In equation (1) test A is defined by the weights or test coefficients a/i, 
fy2, - > a iz- These weights show the extent to which the test calls for each 
one of a set of reference abilities. The test coefficients therefore constitute 
a psychological description of the test. It is a fundamental criterion for a 
valid method of isolating primary abilities that the weights of the primary abili- 
ties for a test must remain invariant when it is moved from one test battery to 
another test battery. If this criterion is not fulfilled, the psychological de- 
scription of a test will evidently be as variable as the arbitrarily chosen bat- 
teries into which the test may be placed. Under such conditions no stable 
identification of primary mental abilities can be expected. The factorial 
methods to be presented are consistent with this criterion, and stable iden- 
tification of the primary abilities can therefore be expected. This criterion 
assumes that the several test batteries are given to the same population. 
The primary abilities that define a test hi one population should be identical 
with the primary abilities which define it in a second population. 

A test may call for two or more abilities that are unique for that test in 
a particular battery. Then the specific variance of the test should be divided 
into parts, one part for each of the several specific abilities. In factorial 
analysis all of the abilities that are specific for a test combine into a single 
variance. 

Table 1 represents a population matrix in which the attributes of each 
member of the population N are recorded with reference to the common 
factors, the specific factors, and the error factors. 
The notation is as follows: 
Subscript i refers to a person; 
Subscripts j and k refer to tests; 
Subscripts m and M refer to common factors* 
Let x refer to common factors; 
y refer to specific factors; 
z refer to error factors. 

Let z m i=the standard score of individual i in the common factor m; 
y iit = the standard score of individual i in the specific factor of test j; 
Zji === e,i/ s/, where e^ is the absolute variable error in the standard score 

en, and / is the standard error of $#. 
Let r= number of common factors; 
rt= number of tests; 

AfsEnumber of individuals in a random sample of the population. 

An interpretation of the cell entries of the population matrix P* is then 

as follows: Each column is descriptive of one individual. The first r entries 

show his standard scores in the r common abilities. The next n entries show 

his standard scores in the n specific abilities. It is here assumed that every 
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psychological test in a finite test battery calls for some specific ability, al- 
though it may be of minor significance. This assumption fits the usual case. 
It is only in an unusual test battery (if such exists at all) that the number 
of specifics can be smaller than the number of tests. The present analysis 
would be essentially the same even for the ideal case in which specifics were 





Table 1 




Population Matrix P* 




N 




#21 #22 #23 . #2iV 


r 




common 


#31 #32 #33 #3W 


factors 






X m i . . 








2/11 2/12 2/13 ... y\y 




2/21 2/22 2/23 ... y^N 


n 




specific 
factors 


2/31 2/32 2/33 ... 2/3W 




-T/-- 




2/nl 2/n2 2/3 ... 2/ntf 




2ll 2j2 2is ... Zijf 




Zzi 2?22 2/23 > 22JV 


n 




error 


231 232 233 ... ZZN 


factors 






Znl Znz 2n3 ... 2 n JV 



assumed to be absent. The last n cells of each column represent the vari- 
able errors in the n standard test scores of an individual. 

The notation r for the number of common factors may be confusing on 
first sight, since the same letter is used for the coefficient of correlation; but 
the correlation will always be designated with a double subscript for the two 
variables. The notation r is retained for the number of common factors 
since it is a customary notation for the rank of a matrix, and it will be shown 
that the number of common factors is the rank of the correlational matrix. 
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Since equation (1) implies the product of two matrices, it is of interest to 
write both of them. The x's in (1) refer to the population matrix P 4 which 
is descriptive of the subjects. The a's in (1) are descriptive of the tests. The 
description of the tests can be written in the form of a test matrix or factorial 
matrix, which is shown in Table 2. 



Table 2 
Factorial Matrix Ft 

n specific factors n error factors 



r common factors 



n 
tests 



an <2i2 #13 . . . ai r 
&21 #22 &23 &2r 



hi 



622 



C22 



C33 



Cnn 



The additional notation is as follows: 

a, m =loading of the common factor ra in test j, 
6,7= loading of the specific factor of test j in test j, 
c 7 -j= loading of the error factor of test j in test j. 

Each row of the factorial matrix describes a test. The first r columns are 
filled, since each test may have a loading in each of the common abilities. 
A common-factor test loading is frequently zero; and this is, in fact, the 
situation that should be explicitly planned for in setting up factorial experi- 
ments. Since, by definition, there is only one specific factor in each test, the 
second section of F is necessarily a diagonal arrangement of the specific 
factor loadings b. The same is true for the error factor loadings c, which 
have a diagonal arrangement. 

It will be assumed that the first r columns of F 4 are linearly independent. 
This is a postulate concerning the test battery which is represented in F 4 . 
Postulate. 

The n tests which constitute the battery are so selected that the columns 
of the factorial matrix are linearly independent. 

It would be difficult to set up a battery which would violate this postu- 
late. It would be a rare occurrence for the columns of F to be depend- 
ent when the number of tests is considerably in excess of the number of 
factors. If such a battery were to be assembled, the factorial solution would 
be a matrix F which reproduced R with less than the true number of refer- 
ence abilities. This is probably a remote contingency. The geometrical rep- 
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reservation of such a solution would be a set of n radial vectors, one for each 
test, which would lie in a space of a number of dimensions less than the 
number of common factors. 

Tables 1 and 2 show the two matrices whose row-by-column multiplica- 
tion is implied in an equation of the type (1). These two matrices are the 
population matrix P 4 and the test matrix or factorial matrix F 4 . Rewriting 
(1) in matrix notation, we have 



(2) 



S - 



Inspection of the matrices P 4 and P 4 reveals that they may be written in 
several sections. The population matrix may be written in three sections, 
as shown in Table 8. Comparison of Tables 1 and 3 shows that the matrix P 4 

Table 5 
Three Components of the Population Matrix 

N N N 





#11 #12 #13 


. Xjjf 


r 


#21 3^22 ^23 


. XZN 


com- 






mon 


#31 #32 #33 . 


#3JV 


factors 






n 
specific 
factors 


O 


n 
error 
factors 


o 



o 



2/n 2/12 #13 ... #i# 

#21 #22 #23 ... #2tf 
#31 #32 #33 ... #3* 
#3t ** 

#nl #n2 #n3 ... #n2V 



O 



o 



o 



S2 23 3 ... 



Matrix Pi 
for common factors 



Matrix P 2 
for specific factors 



Matrix P 3 
for error factors 
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may be written as the sum of three matrices, namely, 

matrix P x for the common factors, 
matrix P 2 for the specific factors, 
matrix P 3 for the error factors, 
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so that 
(3) 



P 4 = Pi + P 2 + Ps . 



The factorial matrix F 4 may also be expressed as a sum of three parts in a 
similar manner. This is shown in Table 4. 

Table 4 
Three Components of the Factorial Matrix 



A" 



#L1 &12 #13 ... #ir 






#31 #32 #33 ... &3r 
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---...*, 
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n 


o 


&22 
^33 

0/7 


o 
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. 


n 






Cn 






C22 


o 


O 


c,, 






Cnn 
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The three parts represent the common factors, the specific factors, and 
the error factors. The factorial matrix F may be written as a sum of these 
three parts, namely, 

(4) F 4 = F 1 + A + D 2 . 

Since, by definition, there is only one specific factor in each test, the middle 
section, DI, is a diagonal matrix. By the same reasoning the third section, 
D 2 , is a diagonal matrix in which each entry shows the error factor of a test. 
Returning now to equation (2), we may express the standard scores in 
terms of the three kinds of factors. Substituting (3) and (4) in (2), we have 



(5) 



S = (Fi + Di + A) (Pi + PI-J- P,) , 
S = FJPi + AP 2 + DJP S . 



A single element of the nXN matrix S is the standard score of individual 
i in test j. It can. be written as follows: 



(6) _ 

N 

By definition, the sum of the standard scores, ^^s/*, of the population N in 
test j is equal to zero. The sum may be written. as follows: 

N r N N N 

(7) 



By definition, the sum of the squares of the standard scores of the N subjects 
must equal N. Then 

N 

(8) 

and hence 

N 
(9) "S^ f ~ 1 ~ tota l variance of testj . 
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Since the factors are uncorrelated, we have 

N N N 

(10) 



where m and M refer to any pair of common abilities (m^M), and where j 
and fc refer to any pair of tests QV&), For the same reason the following 
cross products also vanish: 

N N N N 

(11) X mi y 3 'i = XmiZji = 2/nZH = VS&ti = . 



Substituting (6) in (9) and ignoring the vanishing cross products of (10) 
and (11), we have 

- + %i* + 



Since 0;^^ and 2/3^ are standard scores, 

N N 

(13) Ar 
Since 

it follows that 

(14) ' 
Then, 
(15) 
and 



t-1 tl 
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Hence 

(17) 



Substituting (13) and (17) in (12), we have 
(18) 



in which the total variance of test j is expressed as a sum of three variances 
due to (a] the common factors, (6) the specific factors, and (c) the error fac- 
tors. The (r+2) test coefficients in terms of which testy is described are the 
r values of a^ m and the values of 6# and c//. Equation (18) can be restated 
as follows: 

Theorem. The sum of the squares of the test coefficients of a test is equal to 
unity. 

In fact, a% m) the square of a test coefficient, is that part of the total vari- 
ance of a test j which is attributable to the factor m. In the same manner 
fe]y is that part of the variance of the test j which is attributable to the spe- 
cific factor in test j. Also, <?^ is the part of the variance of test j which is 
due to the variable chance errors in the scores of test j. 

Conununality 

It has been shown that the total variance of a test can be expressed as 
the sum of three variances which are due to (a) the abilities which are com- 
mon to two or more of the tests, (6) the abilities which are unique in that 
they are called for by only one test, and (c) the variable errors. It will be 
convenient to name these three parts of the total variance. The following 
terminology* will be used: 



m = hj 3= communality of test j , 

b|. == Vj EE specificity of test j , 
cj s Cy s= error variance of test j , 

The concept of communality is pivotal in factor analysis, and it will be nec- 
essary to refer to it frequently. 

Definition 11. The communality of a test is its common factor variance. 

* "Theory of Multiple Factors," p. 8. 
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The object of factor analysis is to isolate the r values a/ m for each test j 
and the r values x m i for each person i. 

Since, in factor analysis, the specificity and the error variance combine 
into a single variance that is unique for each test, it will be convenient to 
combine them in the following manner: 

&y + c f rf ~ uniqueness of test j , 
so that 

(19) AJ + u] = 1 . 

Equation (19) shows that the variance of a test may be expressed as the 
sum of two parts, namely, the communality and the uniqueness. 

Definition 12. The uniqueness of a test is the complement of its commu- 
nality. 

An interpretation of (19) is that the total variance of a test can be divided 
into two parts: namely, the communality, that part of its variance which 
is due to factors common to other tests in the battery; and the uniqueness, 
that part of its variance which is due to factors not common to other tests 
in the battery. 

This distinction between communality and uniqueness is crucial in fac- 
torial analysis. If a test calls for several abilities which are unique in that 
they are not called for by any other tests hi the battery, then these unique 
abilities combine with the error variance into a single specific variance for 
the test. The isolation of these unique abilities and the appraisal of individ- 
uals with reference to them cannot be effected by factorial methods until 
the test is inserted in a battery with other tests that do contain these abili- 
ties. Then the abilities which combine into a single specific factor in one 
battery become separate common factors in the new battery. They can 
then be isolated. 

An object of psychological inquiry is to isolate an increasing number of 
abilities until the specific variance of each important test shall be reduced 
to a minimum. It is not likely that any single test will be completely de- 
scribed in terms of the factors which it has in common with those of one 
battery. In order to isolate all of the abilities that are called for by a test, 
it will probably be necessary to insert it in several test batteries in succes- 
sion. The specific variance of a test should be regarded as a challenge; it is 
that part of the total variance of a test which is unique in a particular bat- 
tery, and hence its factorial composition is unknown. In order to test a hy- 
pothesis concerning the abilities which are involved in the specific variance 
of a test, the test should be combined with others which involve the hypo 



64 THE VECTORS OF MIND 

thetical abilities. If the specific variance is reduced the hypothesis is sus- 
tained. 

For the next few years it will probably be more interesting to isolate new 
abilities than to reduce the specificity in particular tests. Increased knowl- 
edge of the primary mental abilities will facilitate the type of experiment 
by which the specificities of particular tests may be reduced. It will prob- 
ably be found that a considerable fraction of the total variance of each test 
is attributable to factors of such limited social significance that the complete 
elimination of the specificity of each test will not be essential in the early 
stages of the scientific study of human abilities. 

The intercorrelations 

Since s/$ are standard scores, the intercorrelation between two tests j 
and k can be written in the simple form 



(20) r ik = 

i=l 

This implies the multiplication of two matrices. The elements of a moment 
matrix M may be defined as follows: 

N N 

(21) m ik - 



so that we have, in matrix notation, 

(22) M = SS f . 
Substituting (5) in (22), 

(23) M = (FA + DA + AP,)(FA + DA + DJP 9 y , 

(24) = (FA + DA + D*Pi)(P[F[ + P&{ + PiDQ . 
Six of the terms of this product vanish. Hence 

(25) M = FjPjPiFi + DjPfJH + DJ>fiD' t . 



THE FACTOR PROBLEM 65 

The population matrix Pi is orthogonal by rows, as can be seen by reference 
to (10) and (13). Hence 

(26) Pf[ = D 3 , 

in which D 3 is a diagonal matrix of order qXq and where 
q = 2n + r = total number of factors. 

The matrix D 3 has the constant element A 7 " in the r diagonal cells of the 
first r rows and columns, and zero in all other cells. 
Similar reasoning applies to P 2 . Hence 

/f>rr\ p T>f T) 

\t ) -^2*2 ^4 J 

in which D* is a diagonal matrix of order qXq with constant element N in 
the n diagonal cells of the rows and columns (r+1) to (r+ri) inclusive, and 
zero in all other cells. 
By the same reasoning 

(28) Ps-Ps = A , 

in which D 5 is a diagonal matrix of order qXq with constant element N in 
the n diagonal cells of the'rows^and columns (r+n.+l) to (r+2n), inclusive, 
and zero in all other cells. 

Substituting (26), (27), and (28) in (25), 

(29) M -- 
(30) 

(31) 

By (20) and (21) we have 

(32) Bi-^AT 

where Ri is a square matrix of order n, the cells of which contain the true 
intercorrelations of the fallible tests. From (31) and (32) it follows that 



(33) Bx = FJi + DJ)[ + 

or 

(34) 



66 THE VECTOKS OF MIND 

The correlational entries of Ri are as follows: 



(35) r kk = 2,4m + &! + ej = 1 

m=l 

(36) r tt = % + 6J + 4 - 1 , 
and 



(37) r& = a/ m ajb , where j s* k . 



m=l 



Equations (35) and (37) show that the terms D\ and D| of (34) affect only 
the diagonal entries of fij. If a new matrix R is defined by the relation 

(38) R = F,F^ 

then RI and JS are identical except for the diagonal entries. By (35) the 
diagonal entry of RI is 1. The diagonal entry of R in the column j and row j 
is the communality AJ. The matrix R will be called the reduced matrix of the 
trice correlations of fallible tests. It will be referred to more briefly as a "re- 
duced correlational matrix." The matrix Ri will be called the complete ma- 
trix of correlations of fallible tests. It will be referred to as a "complete corre- 
lational matrix" in the sense that the complete variance of each test is repre- 
sented by the diagonal entries. 

Let F be the matrix formed by the n rows and the first r columns of FI. 
Then, by (38), 

(39) R - ff > , 

in which the reduced correlational matrix is defined in terms of the common 
factors. The matrix F is an nXr-rowed matrix which shows the weights of 
the r common factors in the n tests. This matrix will be called the "matrix 
of the common factors" or, more briefly, the "factorial matrix." Since in 
factorial analysis it is the common factors that are of principal interest, 
there is no confusion in referring to F as the factorial matrix without quali- 
fication for the common factors. 

The reliability coefficient 

It is customary in psychological work to write the reliability coefficient in 
the diagonal cells of a correlation matrix. By the present analysis it is seen 
that the diagonal entries of RI are unity, while the diagonal entries of R are 
the communalities A?. The relation between the reliability and the com- 



THE FACTOR PROBLEM 



67 



munality of a test may be shown by considering in detail the factorial ma- 
trix for a test j, a parallel test j', another test k, and its parallel test k'. The 
factorial matrix for these four tests is shown in Table 5. 

Let there be r common factors in the four tests. Let 6* be the specific vari- 
ance in test j. Since j and / are parallel tests, it is evident that they must 
require the same common abilities and the same specific ability. Hence b 3 - is 
recorded in the same column of F for both j and/. For the same reason bk 
must be common to tests k and fc', which are parallel. But the variable er- 
rors are uncorrelated by definition, even for parallel tests. Hence F 4 of 
Table 5 shows a separate error factor for each of the four tests. 

Table 5 

Factorial Matrix F for Four Tests, j and Its Parallel f, and k 
and Its Parallel k r . 



r Two 
common specific 
factors factors 


Four 
error 
factors 






b f O 


Cj 
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a n a,-,, a,, ... a, r 


&j O 


O 


W O 


O 


Q>kl &ifc2 &&3 &kr 


O bk 





O c k 





0>kl &k2 dkZ &kr 


O b h 


O 


O O 


Ck' 



The true correlation between the fallible parallel tests k and k' is the re- 
liability of &, The complete correlational matrix is 



(40-) 



l + D! + 



But D 2 in Table 5 does not contribute to the reliability coefficient r,y which 
is not a diagonal entry. The matrix DI does contribute to the reliability 
coefficient because the specificity is an additional common factor in the 
special case of Table 5. Hence 



(41) ruf 

or 

(42) rjH,' = 4 + 61 = reliability of test k . 

By (36) and (42), 

(43) w - 1 - 4 I 



68 THE VECTORS OF MIND 

Equation (43) merely states that the reliability coefficient of a test is the com- 
plement of its error variance. 
Since 

(44) hi = 1 - ul 
and 

(45) ul = Z4 + 4 , 
it follows that 

(46) hi ^ r kk f . 

Theorem. The communality of a test is always smaller than the reliability 
except in the limiting case where the specific factor is absent, in which 
case the communality and the reliability are equal. 

It is of interest to note that the uniqueness cannot be separated into its 
two parts, the specificity and the error variance, by factorial methods. In 
order to estimate the specific variance of a test, it is necessary to estimate 
its reliability by experimental means. The uniqueness can be determined 
by factor methods. The specificity is then 

(47) Z4 = ul - 4 , 
where 

(48) 4 = 1 - rw . 

Since r^y can be estimated only more or less roughly by various experi- 
mental methods, it is clear that estimates of specific variance are necessarily 
equally uncertain, 

The terminology for the different parts of the variance of a test is sum- 
marized as follows: 

Total variance = Ai + B| + c| = l. 

Reliability = hi + 6| = 1 - c| . 

Communality = h\ = 1 u\ . 

Specificity = 6| . 

Uniqueness = ?>! + c| = u\ . 

Error variance = c\ = 1 rw . 

The population space 

The population matrix of Table 1 may be regarded as exhibiting N co- 
ordinates for each of (r-(-2n) points in a population space of N dimensions. 
Each individual of an infinite population may be regarded as defining an 
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orthogonal reference vector. The sample of the population may be regarded 
as defining a set of orthogonal reference axes in as many dimensions as there 
are individuals in the sample. The factors may be regarded as vectors in 
the population space. By (13) and (17) it is seen that the (r+2ri) factorial 
vectors are all at the distance V*N from the origin in the population space. 
By (10) and (11) these vectors are orthogonal in the same space. Hence the 
entries of P 4 may be regarded as \/N times the direction cosines of (r+2ri) 
orthogonal factorial vectors in the population space. Since the factors are 
represented by orthogonal vectors in the population space, it follows that 
the total factor space is a subspace of the population space. 

The factorial matrix F 4 may be regarded as the co-ordinates of n points 
in the same space. By (18) these points are at unit distance from the origin. 
The entries of F may be regarded as the direction cosines of n unit test vec- 
tors in the factor space. 

The score matrix may be regarded as exhibiting the co-ordinates of each 
test in the population space of N dimensions. The cells of the moment ma- 
trix M show N times the scalar products of pairs of test vectors. The com- 
plete correlational matrix RI shows the scalar products of pairs of test vec- 
tors in the population space, while the reduced correlational matrix R shows 
the scalar products of the projections of these test vectors in the common- 
factor subspace of the population space. Each test is represented by a unit 
vector in the population space. The square of the length of its projection 
in the common-factor subspace is its communality. 

The common-factor space 

The geometrical representation of the factorial matrix is fundamental in 
factor analysis. The factor matrix of Table 2 can be regarded as exhibiting 
the (r+2ri) co-ordinates of n points in a total factor space of (r+2ri) dimen- 
sions. The points may also be regarded as the termini of as many test vec- 
tors. Each test is then a unit vector in the total factor space. The scalar 
product of a pair of test vectors is the correlation between the two tests. 

Since it is the common factors that are of primary interest in factor anal- 
ysis, it is profitable to consider mainly the common-factor space. The com- 
mon-f actor space is defined by the first r columns of the factorial matrix F 4 . 
It shows the r co-ordinates of each of n tests in a common-factor space of r 
dimensions. Here, again, the scalar product of a pair of test vectors is the 
correlation between the tests. The correlation is unaffected by the projec- 
tions of the test vectors into the specific space and into the error space be- 
cause these projections are orthogonal by definition. The length of each 
test vector in the common-factor space is the square root of its communal- 
ity. The complement of the communality of each test is the square root of 
its projection in the unique factor space. 



CHAPTER II 
THE FUNDAMENTAL FACTOR THEOREM 

The correlational matrices 

The factor theorem which is basic for the present analysis is equation 
(39-i), namely, 

FF' = R . 

It states, in matrix notation: 

Theorem 1. The product of the factorial matrix and its transpose is the re- 
duced correlational matrix. 

In the theoretical development of this theorem in the previous chapter the 
attributes of the individuals and of the tests were chosen as natural start- 
ing points, so that R could be written if F were known. The present scien- 
tide problem is the reverse. It is the intercorrelations, RQ, that are known. 
The object of the factorial analysis is to find F. The theory, as well as the 
statistical methods that are involved in factor analysis, is implied in this re- 
versal, namely, that when R is given experimentally, the problem is to 
findF. 

By the definition of a correlation coefficient in (22-i) and (32-i) it follows 
that 

(1) Ri = jfSS', 

and hence the correlational matrix is symmetric and factorable. It can be 
shown that R-L is a positive-definite matrix. From this follows the funda- 
mental factor theorem: 

Theorem 2. For any correlational matrix R there exists a corresponding 
factorial matrix F such that FF' = R. 

The bold-faced notation R refers to any correlational matrix. It may 
have any values in the diagonals which preserve the Gramian properties of 
the matrix. Hence R may contain unity or the reliabilities or the com- 
munalities in the diagonal cells. The bold-faced notation F refers to any 
factorial matrix which reproduces 1?. 

To write the factorial description of the tests in the form of matrix F im- 
plies, of course, that an orthogonal co-ordinate system is given. In the re- 
verse problem an interesting indeterminacy appears as regards the co-ordi- 

70 
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nate system in that the co-ordinate reference axes are not defined by the cor- 
relational matrix R. It has been shown that the entries of R which are the 
true correlations of fallible tests can be regarded as the scalar products of 
pairs of vectors. Such a product is a function of the scalars of the two vec- 
tors and the angle of separation between the vectors. But all three of these 
quantities are independent of the location of the orthogonal axes of refer- 
ence. Rotation of the orthogonal co-ordinates implies: 

Theorem 3. An infinite number of matrices F can be written which will re- 
produce a given correlational matrix R. 

In order that a unique solution of F may be found for any given matrix R, 
it will therefore be necessary to impose further restrictions on the solution. 
Such additional criteria are to be found in the psychological considerations 
that govern the problem. 

Considerable psychological interest attaches to the signs of the co-ordi- 
nates which constitute the entries of F. If the variables are traits of people, 
it is usually possible to ascribe acceptable meanings to both positive and 
negative co-ordinates. If cheerfulness is one of the orthogonal axes, there is 
no difficulty in defining a personality trait as a vector with either positive or 
negative projection on the reference axis of cheerfulness. Thus, grouchiness 
might be a vector with a negative projection on cheerfulness. 

The case seems to be different with those traits which concern the things 
that people can do. These are the traits which have been defined as abilities. 
An individual can, of course, be described as above or below the mean of a 
random sample of the population with regard to any specified ability; but, 
with current psychological concepts, it is preferable to avoid a formulation 
by which a task might have a negative projection on an axis of reference 
which defines an ability. One psychological interpretation would be that 
the performance of such a task is actually facilitated by some sort of ability 
which is less than totally absent! 

Since the signs of the entries in F are of considerable interest, the follow- 
ing theorems will be found useful. 

Theorem 4. The signs of all the entries in a column of F may be changed 
without altering the correlational matrix R. 

This may be seen from the factor theorem 1. It may also be inferred from 
the geometrical consideration that the scalar products of R are independent, 
not only of the precise locations of the orthogonal co-ordinate axes, but also 
of reversal of their direction, as represented by a reversal of sign of the co- 
ordinates in a column of F. This geometrical fact has a psychological coun- 
terpart. The correlation between any two traits remains unaffected by the 
arbitrary decision to call one of the component reference traits "plus cheer- 
fulness" or "minus gloominess." The theorem can be inferred algebraically 
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from (35-i) and (37-i), where it is seen that a change in sign of a im and a^ 
for a fixed column m does not alter the value of r,k. 

Theorem 5. // all of the signs are reversed in a row of F, then all the signs 
are reversed in the corresponding row and the corresponding column 
ofR. 

To change the signs in a row of F is to reverse the direction of a test 
vector. Its scalar remains the same, while its angular separation < from 
any other test vector is changed to the supplement of <. Hence the absolute 
values of the correlations of this test with the other tests remain unaltered, 
but their signs are reversed. The psychological interpretation can be shown 
by an example, namely, that if one variable correlates positively with "plus 
tactfulness," then it will correlate negatively with "minus tactfulness," 
which might be defined as "plus tactlessness." 

This theorem can also be inferred algebraically from (37-i), where it is 
seen that a change in sign of a/ m for a test j alters the sign of r# where 
j?k. From (35-i) it is seen that when a* is reversed in sign for a test k, 
the value of r# is not changed in sign for j ft. The self-correlation remains 
positive for all possible reversals of sign of tests and factors. 

The number of independent factors 

One of the principal problems in factor analysis is to ascertain the number 
of linearly independent factors that must be postulated in order to describe 
the scores in the tests as linear combinations of the factors. The columns of 
F represent independent factors, so that the number of independent factors 
is the number of columns of F. But this is also the rank of F. It can be 
shown that the ranks of R and of F are always the same. Hence we have :* 

Theorem 6. The number of linearly independent factors represented by the 
interpretations of n tests is equal to the rank of their correlational 
matrix R. 

Owing to sampling errors, the experimentally obtained correlation coeffi- 
cients are not the true Intercorrelations which are defined as the cell entries 
of R. The experimentally obtained correlations constitute a square matrix 
of order n which will be designated R Q . The distribution of discrepancies be- 
tween the experimental values hi RQ and the corresponding true correlations 
of the fallible tests in R should have a dispersion not excessively greater 
than that to be expected from the known standard errors of the experi- 
mental coefficients. 

Since the sampling errors in R Q are fortuitous, it should be expected that 
the rank of RQ is equal to its order, namely n. The theorem concerning the 
number of factors shows that the number of common factors that are re- 
quired in order to account exactly for the coefficients in R Q is equal to the 

* 'Theory of Multiple Factors/ 7 p. 20. 
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number of tests. Such a solution is of no scientific interest. It corresponds 
to the more obvious situation in which the number of parameters in a hy- 
pothesis is equal to the number of observations. In a simple curve-fitting 
problem the analogous situation would be that in which a curve with r in- 
dependent parameters is fitted to a set of r points. The significance of an 
equation so chosen is not convincing. One of the fundamental principles of 
science is that the convincingness of a scientific hypothesis varies with the 
degree to which it is overdetermined by the data. To postulate as many 
reference abilities as there are tests constitutes the absurdity of postulating 
as many categories as there are facts to be explained or described. To do 
so would be to acknowledge the defeat of scientific effort. 

The problem of describing factorially the variables whose experimental 
intercorrelations are given in R is essentially that of finding another ma- 
trix R (a reduced correlational matrix) of lowest possible rank whose cell 
entries do not deviate from those of R Q by more than might be expected from 
the sampling errors in the experimental coefficients of R Q . If such a matrix R 
can be found, in which the rank r<n, a scientifically significant solution F 
may be possible. The converse is not necessarily valid, since the present 
reasoning is based on a set of postulates which by no means exhaust the 
possible ideal constructs in terms of which the variables may be described. 
But in any event the number of degrees of freedom of the construct must be 
considerably smaller than that of the experimental data that are to be uni- 
fied. 

In dealing with the experimentally obtained values in the correlational 
matrix jRo, it must be remembered that the diagonal entries are unknown. 
The communalities are numbers between and +1. If the smallest number 
of factors in terms of which the scores can be linearly expressed is r, then the 
factorial matrix F will have r linearly independent columns. But the num- 
ber of columns is then the rank of F. Since the rank of F and the rank of R 
are always the same, we have the following theorem: 

Theorem 7. The smallest number of independent factors that will account 
for the intercorrelations of n tests is the minimum rank of the correla- 
tional matrix with the diagonal entries treated as unknown positive val- 
ues between and +1. 

Algebraic and configurational uniqueness 

It has been shown that if a factorial matrix F has been found such that 
FF' = R, the solution F is not algebraically unique because the co-ordinate 
system of F may be rotated arbitrarily without affecting the reproduction 
of the correlations in JR.* Such rotation alters the numerical entries in F. 
It is in this sense that the matrix F is not algebraically unique. 

* "Theory of Multiple Factors," p. 10. 
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The entries in F may be regarded as the orthogonal co-ordinates of n 
points in the common-factor space. These points constitute the termini of 
the test vectors whose scalar products are shown in R. In this sense both F 
and R represent the same configuration. If there exists only one configura- 
tion that will satisfy R, then there is only one configuration that can be rep- 
resented by F. Rotation of the co-ordinate system does not alter the con- 
figuration either in R or in F. It is in this sense that F may be a unique solu- 
tion to the factor problem which is stated in R. If, on the other hand, the 
given matrix R with unknown diagonal entries does not define a unique con- 
figuration, then any corresponding matrix F cannot be unique. 

Since the psychological problem consists in describing the abilities that 
are represented in the common-factor space, it seems evident that no psy- 
chologically meaningful solution can be expected unless the given matrix R 
defines a unique configuration in the common-factor space. It is therefore of 
considerable importance to ascertain the conditions under which a unique 
configuration is defined by the given intercorrelations. 

This problem may be clarified by a very simple but extreme example of a 
correlational matrix which does not define a unique configuration. Consider 
a set of two tests. The correlational matrix is of order 2; and it contains only 
one intercorrelation in addition to the two communalities, which are un- 
known. If two abilities are involved, the rank of the correlational matrix 
must be 2. The two diagonals may be given any values between and 1 
by which the rank remains 2. Any pair of diagonal values defines the sca- 
lars of the two vectors. The angular separation is determined so that the 
scalar product is equal to the observed intercorrelation of the two tests. 
It is evident that for each pair of arbitrary diagonal values a different con- 
figuration will be obtained. Evidently, then, the two tests are not sufficient 
to define two common factors or abilities. The same type of reasoning can 
be extended to more tests and to higher dimensions. 

The relation between the number of tests n and the number of independ- 
ent factors r is subject to a limitation with regard to the present scientific 
problem. The number of reference abilities in n tests must satisfy one of the 
three following possibilities, namely, r > n y r n, or r < n. By the factor the- 
orem (1) it is seen that n tests will produce a correlational matrix whose 
rank will not exceed its order n. If, then, r>n, the factors cannot be iso- 
lated by factorial methods. If more than n factors are involved, it is neces- 
sary to augment the test battery with additional tests before the reference 
factors can be isolated. If rn, there are as many factors as there are tests. 
Such a solution is always possible, and it is therefore trivial as far as the sci- 
entific problem is concerned. The solution in which r = n violates the fun da- 
mental postulate of science that every valid hypothesis is overdetermined 
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by the data. This case is discussed further in chapter iv on "The Principal 
Axes." The only allowable case is that hi which r < n. This leads to the fol- 
lowing postulate. 

Postulate. The number of reference abilities in a test battery is less than the 
number of tests. 

This condition must be satisfied, or the reference abilities cannot be iso- 
lated by factorial methods. In setting up a test battery for the purpose of 
discovering the primary abilities, the experimenter must so select the tests 
that the number of primary abilities is smaller than the number of tests in 
the battery. A more exact relation between r and n which must be satisfied 
in order that a unique solution shall exist will now be shown. 

The number of intercorrelations in R which are to determine the config- 
uration is 

n(n-l) 



These intercorrelations constitute the observations. The number of parame- 
ters in F is nr, but this number can be reduced. If the first co-ordinate axis 
is passed through the first test, then 

The second orthogonal axis may be so placed that test 2 lies in the I-II 
plane. Then 



Table 1 

r common factors 
a n ... 

#21 &22 ... 

f, f\ C\ 



Onr 



This process can be continued until there are one or more zero co-ordinates 
for each of the first (r 1) tests. The factorial matrix will then appear like 
Table 1, which has been arranged so as to represent n tests and r factors. 
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The number of parameters in F then becomes 

nr |r(r 1) . 

In order that there shall be a unique solution, the number of experimentally 
independent values in R Q must equal or exceed the number of linearly inde- 
pendent* parameters in F. Hence 

^ r(7 1) n(n-l) 

(2) nr -- 3 = - 2~~ ' 

The condition for a maximum value of r for a given value of n is represented 
by substituting an equality sign for the inequality. The condition then be- 
comes 

(3) nr _K = ?fcl) 



or 

(4) 2nr - r(r-l) - n(n-Y) = . 

Solving the quadratic in r, we have the following theorem. 

Theorem 8. In order that the correlational matrix R with unknown diagonals 
for n tests and r common factors shall represent a unique configuration, 
it is in general necessary that 



(5) 



The suppression of the positive sign before the radical in (5) is justified by 
the postulate that r<n. When the equality sign is used in (5), the value 
of r becomes integral for certain values of n. Then the number of independ- 
ent parameters of F is exactly equal to the number of experimentally inde- 
pendent coefficients in RQ. Such is the case when n=6 and r=3. 

* In mathematical and scientific use the term independence has several different mean- 
ings. The context usually indicates clearly enough which of several meanings is implied. 
It may be useful to enumerate three of these meanings. Linear independence is here used 
in the sense in which the term is defined in current mathematical textbooks. The term 
statistical independence is here used to mean zero correlation, i.e., the case in which cross 
products of two variables vanish. Its geometrical representation is the orthogonality of 
a pair of vectors. Several values are here said to be experimentally independent if they 
have been separately determined in experimentation. 



THE FUNDAMENTAL FACTOR THEOREM 



77 



Since (4) is symmetric in n and r, n can be expressed explicitly in terms of 
r by analogy from (5), so that 



(6) 



n ^ 



This relation shows the minimum number of tests required for the deter- 
mination of r factors. Formula (6) shows, for example, that there must be at 
least eight tests in order to determine four factors. 

It is useful to have a table to show the smallest number of tests that will 
just determine a given number of factors or the largest number of factors 
that can just be determined by a given number of tests. This information is 
summarized for ten factors in Table 2. 

Table 2 



No. of Factors 
r 


No. of Tests 
n 


No. of Factors 
r 


No. of Tests 
n 


1 


3* 


6. . . 


10* 


2 


5 


7 


12 


3 


6* 


8 


13 


4. 


8 


9 


14 


5 


9 


10 


15* 











* The asterisks refer to integral values of both r and n in (6). 

The case of n tests and n factors 

There is a simple solution which is satisfactory as long as the factor prob- 
lem is regarded only in its mathematical aspects but which is fictitious as a 
solution to the present psychological problem. Since this simple solution 
with as many factors as there are tests is certain to occur to anyone who 
studies the factor problem, some discussion of its limitations is in order even 
though it can be shown to be psychologically trivial. 

In this solution each test is represented by a radial unit test vector in 
space of n dimensions. Since the scalars are all unity, the angular separa- 
tions between the vectors must be adjusted in order that the correlations 
shall represent scalar products of these vectors. In the correlational matrix 
there are n(n 1)/2 experimentally independent correlation coefficients 
where unity is written in each diagonal cell. In the factorial matrix with as 
many factors as there are tests the number of independent parameters is 
also n(n 1)/2, since the factorial matrix is normalized by rows. Conse- 
quently, it may be expected that an exact solution exists in the form of a 
square matrix F of order n and rank n which reproduces exactly the experi- 
mentally obtained correlation coefficients in -Bo- 
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The fallacious character of the solution in which there are as many factors 
as there are tests can be seen by considering the fact that it assumes as many 
degrees of freedom in the hypothesis F as there are independent experi- 
mental observations in R . This violates the postulate of science that a 
valid hypothesis is overdetermined by the data. Hence the solution is sci- 
entifically trivial even though it is mathematically valid. 

That the description of n tests by as many factors is an "erroneous solu- 
tion can be seen as well from other considerations. If the number of postu- 
lated common factors is equal to the number of tests, then it is possible to 
account for the intercorrelations of RQ exactly by the n common factors. But 
the experimentally obtained correlations in R Q contain the effects of at least 
three sources of variance which are known to be unique for each test. These 
are (a) the variable chance errors in the scores of the N individuals, (6) the 
specific factors or abilities which are almost certain to be involved in each 
test of any finite battery, and (c) the sampling errors in the coefficients of 
RQ. All three of these sources of variance are unique for each test ; and hence 
they must be accounted for by unique factors, i.e., factors which are, by 
definition, not common factors. But the solution in which n common factors 
account exactly for the n tests leaves no part of the variance to the unique 
factors that are known to exist. Hence such a solution can be discarded by 
psychological considerations apart from mathematical reasoning. The rea- 
son why these considerations are not immediately evident in dealing with 
the factor problem is that the existence of the three sources of unique vari- 
ance in the n tests is a scientific fact quite extraneous to the correlational 
matrix R Q . In other words, more is known about the tests than is given in 
the correlational matrix. This additional information, which is not given by 
the intercorrelations as such, is our knowledge that each test is influenced by 
factors that are unique and not common. Although it seems evident from 
scientific, as well as psychological, considerations that the case of n com- 
mon factors for n tests is trivial, there is some interest in knowing that such 
a solution can be written quite readily for any correlation table. 

A method of factoring any symmetric matrix* 

The solution to be described is a simple general method of factoring any 
symmetric matrix. It will be called the diagonal method. Let Table 3 repre- 
sent a correlational matrix R } and let Table 4 represent a factorial matrix F 
of order nXr in which r is the rank of F and the rank of R. It will be as- 
sumed that F has been rotated as described in a preceding section (Table 1) 
so as to minimize the number of independent parameters. 

By the factor theorem (1), 

(7) r u = al . 

* "Theory of Multiple Factors," p. 13. 
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If the diagonal self-correlations are known, then an is known. If the self- 
correlations are unknown, then ru may be set equal to unity, in which case 
an is also unity. 
The correlation 

and hence 

(9) fl*i ; 

#11 

so that the entries in the first column of F can be determined. 

Table S 



7*12 7*13 ... Tm 



7*23 



7*13 ?23 2*33 



T^n Tin ?* 

Table 4 

r 

an 

&2I O22 

#31 &S2 83 



a n i 



The correlation 
(10) 
so that 

en) 
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Here, as before, a given diagonal entry may be used; but if the diagonal 
entry is unknown, it can be given an arbitrary value of unity, which means 
that F shall represent the total variance of each test. 
The correlation 

(12) Tfo = 0210M 

so that 

/ 1Q N n r fc2 

(13) 0* 2 = - 



and hence the second column of F can be determined. 
The correlation 

(14) 7-33 - 01i + a| 2 + 033 ? 
so that 

(15) 0f 3 = % ~ 031 - 032 

The correlation 

(16) TKS = 0310M + 0320*2 + 0330*3 , 

so that 

,1-^ TkS 0310H ~ 

(17) 0,3 - - - 



and hence the third column of F can be determined. 

If R is of rank r, there will be r columns of F. If this procedure is con- 
tinued to column (r+1), it will be found that the entries in such a column 
all vanish. It will be seen by equations of the type (8), (12), (16), that each 
of the coefficients in R determines a parameter in F if r=n. 
This method illustrates the following theorems. 

Theorem 9. Any symmetric matrix A of order nXn and of rank r can be 
factored into the matrix B and its transpose B' where B is a matrix of 
order n Xr and of rank r. 

Theorem 10. Any symmetric matrix A of order nXn and of rank r in 
which all but r of its diagonal entries are unknown can be factored into 
an nXr matrix B and its transpose B' where B is a matrix of order 
nXr and of rank r. 
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A simple numerical example is given in Table 5 which shows the intercor- 
relations of four fictitious tests with unity in the diagonal cells. The rank 
is 4. The corresponding factorial matrix is shown in Table 6. In Table 7 the 
same intercorrelations are reproduced with communalities in the diagonal 
cells by which the rank of the matrix is reduced to 2. Corresponding fac- 
torial matrices are shown in Table 8. 



Table 5 

Fictitious Correlational Matrix 
I II III IV 

1.00 + .56 + .24 - .61 



+ .56 1.00 - .12 - .63 



+ .24 - .12 1.00 - .18 



4 _ .61 - .63 - .18 1.00 



Table 6 
Factorial Matrix Which Reproduces the Arbitrary 

Symmetric Matrix of Table 5 
I II III IV 



+1.000 000 















+ .560 000 +.828 493 



+ .240 000 -.307 064 +,920 930 



-.610000 -.348102 -.152552 +.695308 



The rank of a matrix 

Since the number of linearly independent factors has been shown to be 
the rank of the correlational matrix, it is of some interest to investigate the 
possible means for determining the rank of a matrix. The rank is defined as 
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the highest order of the non-vanishing minors, but to expand all of the 
minors even of a specified order is a prohibitive task when n is large. For the 

Table 7 

The Same Correlational Matrix as in 
Table 5 except for Communalities in 
Diagonal Cells Which Reduce the Rank 



1 


I II 

+ .58 +.56 


III 

+ .24 


IV 
-.61 


2 


+ .56 +.74 


-.12 


-.63 


3 


+ .24 -.12 


+ .72 


-.18 


4 


-.61 -.63 


-.18 


+ .65 



scientific problem it is not of much value to have methods of determining 
the rank, because the rank of a correlational matrix R Q with experimentally 
obtained coefficients is known to be equal to its order. This is evident be- 

TaUe 8 

Factorial Matrices Which Reproduce the Symmetric 

Matrix Table 7 
I II I II 

+ .761577 .000000 



1 


+ .70 


+ .30 


2 


+ .50 


+ .70 


3 


+ .60 


-.60 


4 


-.70 


-.40 



+ .735316 +.446442 
+ .315135 -.787839 
-.800969 -.091915 



cause sampling errors and chance errors in the scores are fortuitous compo- 
nents in the coefficients. 

The theorem to be described here is useful for estimating the rank of a 
matrix when the cell entries can be assumed to be free from experimental 
errors. It may be useful in estimating the rank of RQ containing expert- 
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mental coefficients when n is large, but it is not likely to be useful when n 
is as small as 10 or 15. The theorem is relevant to the factor problem, and 
some useful adaptation of it may be made to fallible data. 

Theorem 11. If any matrix of rank r is sectioned into a composite square 

matrix of order s where s>r, then the determinant of the composite 

matrix vanishes. 

Table 9 



14 12 


6 8 


2 


6 104 


21 9 


17 


7 6 


3 4 


1 


35 30 


15 20 


5 



The matrix will be said to be sectioned when the columns have been di- 
vided into s groups, and when the rows have also been divided into the same 
number of groups. Let r 2 as an example. Since s>r, we may let s=3. 
Then the n columns of R will be divided into three groups of p, (qp}j and 
(nq) columns, respectively; while the n rows of R will be divided into 
three groups of t, (uf), and (nu) rows, respectively. The matrix R will 
then be sectioned. 

Table 10 
26 14 2 



110 30 17 



78 42 6 



The composite matrix will be defined as the square matrix of order s in 
which the entries are the sums of the elements in the corresponding parts of 
the sectioned matrix. The example of Table 9 illustrates the formation of a 
composite matrix. This 4X5 matrix is of rank 2. It has been sectioned into 
a 3X3 square matrix by arbitrarily dividing the columns into three groups 
of 2, 2, and 1 columns, respectively, and by arbitrarily dividing the rows 
into three groups of 1, 1, and 2 rows, respectively. The composite matrix is 
shown in Table 10. Its determinant vanishes. 

The proof of the theorem will be written for rank 2, but it can readily be 
generalized for any rank. If R is of ratnk 2, it is possible to find two rows that 
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are linearly independent. Let these be the first and second rows. Then the 
elements of the jth row can be expressed as a linear function of the first two 
rows so that 

+ 



(18) 



It is evident that the sum of the first p entries of row j can also be expressed 
as the same linear function of the corresponding'sums in the first two rows. 
We have then 



(19) 

k=l k=l 

Similar summations may be written for the other two groups of columns so 
that 



(20) r# = mi TO* + 



(21) Tfk = mi rub + m 2 

*-(+!) *(+!) *-(+!) 

These summations may be represented in an nX3 matrix as shown in 
Table 11. Since each of the rows can be expressed as a linear function of the 
first two rows, it follows that the rank of this nX3 matrix is also 2. The col- 
umns may be so arranged that the third column of this matrix may be ex- 
pressed in terms of the first two columns. This reduction by columns is 
similar to the reduction by rows that has been described. This reduction 
by columns gives a 3X3 composite matrix whose rank is 2, and hence its de- 
terminant vanishes. If the rank of R is equal to or greater than the order s 
of the square composite matrix, then the determinant of the composite does 
not necessarily vanish. 

This theorem and other considerations about the rank of a matrix are of 
analytical interest because of the fact that the rank has been shown to be 
equal to the number of linearly independent common factors which are nec- 
essary to account for the intercorrelations. It does not seem to be feasible to 
apply this theorem directly to the determination of the communalities be- 
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cause of the sampling errors in the coefficients. It is possible that the the- 
orem can be applied with profit to a large matrix whose rank is only a frac- 
tion of its order. 

Methods of estimating communalities 

Before the correlational matrix R can be factored into the matrices F and 
F' which constitute the solution, it is necessary to compute or to estimate 
the communalities. If the cell entries of the correlational matrix are in- 
fallible, the computation of the communalities is a relatively simple matter; 

. Table 11 



1 



- &u 



1 



/ ^ 



P+l 



g 

/ ^ 

P+l 



. 
P+l 



g 

/ ^ T 



2+1 



X^ r 

fl+1 



^ 

5+1 



but if the coefficients are experimentally obtained values, the communalities 
can be at best only estimated. Fortunately, the estimates of the communal- 
ities need not be at all close when the number of tests or variables is large. 
When the number of tests is as small as ten or twelve, it becomes essential 
to ascertain the communalities with some degree of exactness. 

In this section several methods of computing or estimating the commu- 
nalities will be described. Most of these methods are not suitable for pur- 
poses of computation, partly because of the limitation that experimental 
data are affected by sampling errors and partly because some of the methods 
are prohibitive in arithmetical labor. Those who study the factor problem 
analytically will find these methods of some interest. 

One of the simplest of these methods is used as a first estimate for the 
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centroid method which is described in the next chapter. By successive ap- 
proximations the communalities may be determined to any required degree 
of exactness. 
1. Expansion of a minor of order (r+1) 

If the correlation coefficients are infallible, a simple procedure for com- 
puting the communalities is as follows: In order to compute the communal- 
ity of a test j, select any minor in the correlational matrix which contains the 
diagonal entry for test./ but no other diagonal entries, and which is of order 
greater than the rank. By definition of the rank of a matrix this minor must 
vanish. Its expansion is a linear equation in one unknown by which the 

Table 12 





1 


2 


3 


4 


5 


6 


7 


8 


1 




.56 


.16 


.24 


.72 


.64 


.40 


.24 


2 


.56 




.38 


.49 


.67 


.72 


.63 


.53 


3 


.16 


.38 




.48 


.24 


.40 


.52 


.54 


4 


.24 


.49 


.48 




.34 


.52 


.64 


.65 


5 


.72 


.67 


.24 


.34 




.76 


.52 


.35 


6 


.64 


.72 


.40 


.52 


.76 




.68 


.56 


7 


.40 


.63 


.52 


.64 


.52 


.68 




.71 


8 


.24 


.53 


.54 


.65 


.35 


.56 


.71 





communality may be computed. It is evident that this simple method is not 
applicable to fallible data, and consequently the method is not practically 
useful. It is possible that this method may be generalized into a useful sum- 
mation formula. 

In Table 12 are reproduced the intercorrelations of eight hypothetical 
variables. The rank of the matrix is 2. Table 18 shows a minor of order 3 
with one unknown entry, namely, the communality for variable No. 1. In 
order that the expansion of the determinant of Table IS shall vanish, the 
unknown diagonal entry must be .64. If the rank is unknown and if it is 
assumed too high (say 3), it will be found that the coefficients of /if, as well 
as the numerical terms, all vanish. This indeterminacy can be removed by 
assuming a lower rank. An exception is the case in which the minor of Table 
13 is of rank 2 when some other minor in Table 12 of order 3 or higher does 
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not vanish. Such, a situation would be discovered routinely by the centroid 
method, so that it is not necessary to evaluate all possible minors of order 3 
in Table 12. 

2. Grouping of similar tests 

If the test battery is large enough so that each test belongs in a constella- 
tion of similar tests, then the tests in each constellation will be represented 
by vectors in the common-factor space with relatively small angular separa- 
tions. The communality of a test is the square of the length of its vector. 
If the angular separations between several test vectors are relatively small, 
then the projection of a test vector on the centroid vector of the constella- 
tion will be nearly the same as the length of the vector. The square of the 
projection may be used as an estimate of the communality of the test with 
the knowledge that the estimate will be slightly too low. The projection of 

Table IS 





1 


2 


3 


1 


hi 


.56 


.16 


4 


.24 


.49 


.48 


5 


.72 


.67 


.24 



each test vector on the best fitting single vector for the constellation is es- 
sentially the same as the loading of the test with the single common factor 
which best describes the intercorrelations of the tests in the constellation. 
Relatively simple methods for dealing with the special case of rank one are 
described in chapter v. 

3. Grouping of three tests 

A special case of the preceding method is that of using only three tests in 
a constellation. Since the intercorrelations of three tests can always be ac- 
counted for exactly by a single common factor, this method does not con- 
tain any check of internal consistency. To obtain such a check for a single 
common factor requires at least four tests. This is Spearman's problem, 
which is discussed in chapter v. 

One procedure for estimating the communality of a test j is to select the 
two other tests which have the highest correlations with testy. Let these two 
additional tests be k and L If the test battery is so constructed that each 
postulated ability is represented by several tests, it can be expected that the 
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three tests j y k, and Z will be represented by test vectors with relatively 
small angular separations. If this condition is satisfied, the three vectors can 
be represented approximately by their projections on a common centroid 
vector, so that the intercorrelations are nearly accounted for by a single 
factor common to the three tests. We have then 

(22) Tfr - 

(23) r,-i = 

(24) TU = 
so that 

(25) 5! 
^ J TJI 

or 



But 

(27) r sk 

and hence 

(28) r 
so that 

(29) * 

where tests k and I are selected because of high correlations with j. 

This formula is familiar. In fact, it is Spearman's* formula for the correla- 
tion of a test with the common factor gr, but it is here used under quite dif- 
ferent circumstances and with different assumptions. Spearman uses this 
formula to ascertain the correlation of a test with the central intellective 
factor under the assumption that only one principal factor is operative. 
Here two tests are selected because they correlate highest with test j under 
the assumption that the intercorrelations of these three tests may be de- 
scribed in terms of a single common factor, but it is also assumed that there 
are different common factors for different sets of three tests that may be 
selected in the battery. It is not assumed that the common factor is the 

*The Abilities of Man (Macmillan Co., 1927), Appendix, eq. (19). 
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same for all combinations of three tests. The formula is used here merely to 
estimate the communality of each test. 
The diagonal entry for test j is then 

(30) ^' 



where tests k and I are the two tests that correlate highest withy. This pro- 
cedure is continued in estimating the diagonal entry for each of the n col- 
umns. In general, these values should be slightly too low. 

If as many as four tests of each kind have been included in the battery, 
then an estimate of the communality of each of them may be taken as the 
average of four sets of three tests. 

One useful circumstance is that the estimate of the communality is of 
significance only when the number of variables is relatively small say eight 
or ten. When the number of variables is as large as thirty or forty, any 
value between and +1 may be recorded in the diagonal cell of each column 
without affecting noticeably the resulting factor loadings as determined by 
the centroid method. The reason for this is that the diagonal entry has a 
very slight effect on the relative order of magnitude of the sum of a column. 

In selecting the tests k and I which are to be used for estimating the com- 
munality of j } it is probably best first to correct for attenuation. Then the 
two highest correlations in each column indicate which two tests to select 
for each column. The communalities are determined by equation (30), in 
which raw coefficients are used. The correction for attenuation may be used 
only to ascertain which tests are to be selected in each column, although this 
refinement is probably not essential. 

4. Highest coefficient in each column 

Inspection of equation (30) for estimating the communality of a test sug- 
gests a further simplification in the estimate. The numerator contains the 
product of the two highest correlations in the column for test j. The de- 
nominator is the intercorrelation of the two tests so selected, namely, k and 
Z. If these coefficients are of the same order of magnitude, then the esti- 
mated communality of testy will be nearly equal to the highest intercorrela- 
tion in column j. This is the method that has been found in practice to give 
consistently better results than any of the many other much more elaborate 
methods that have been tried so far. This method is used as a first approxi- 
mation in the centroid method of extracting the test coefficients. 

5. Linear dependence of rows or columns 

If the rank of a correlational matrix is r } then any row may be expressed 
linearly in terms of any r independent rows. It may be possible to general- 
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ize this principle into a method of computing the communalities for fallible 
data. 

Since the rank of the correlational matrix of Table 12 is 2, any row can be 
expressed as a linear function of any two independent rows. Table IS shows 
a third-order minor of Table 12 in which the first row can be expressed as a 
linear function of the second and third rows. The two multipliers may be 
determined from columns 2 and 3 by equations of the type (18). When these 
are known, the communality may be computed. 

Another example will be shown with reference to Table 12. Assume that 
the second and third rows are independent. Consider a 3X6 matrix con- 
sisting of the first three rows of Table 12 and all of its columns except 2 and 
3. This matrix may be assumed to be also of rank 2; and it contains only 

Table 14 





1 2 3 


4 


1 


hi 


.56 


.16 


.24 


5 


.72 


.67 


.24 


.34 


6 


.64 


.72 


.40 


.52 


7 


.40 


.63 


.52 


.64 


8 


.24 


.53 


.54 


.65 



one unknown entry, namely, the communality of the first variable. If the 
first row is expressed linearly in terms of the second and third rows, the two 
multiplying coefficients may be determined from any pair of independent 
columns. When these are known, the unknown communality may be com- 
puted. 

6. Sectioning of the matrix 

A correlational matrix is square, and it may be divided into four quad- 
rants in such a way that all of the unknown diagonal entries lie in the upper 
left and the lower right quadrants. All of the entries in the upper right and 
lower left quadrants are known. These two quadrants are symmetric about 
the diagonal A part of the matrix of Table 12 may be sectioned, as shown 
in Table 14- A composite matrix of rank 2 may be formed as shown in Table 
15, in which the first row can be expressed as a linear function of the second 
and third rows. The two multiplying constants may be determined from 
the second and third columns of Table 15. When these multipliers are 
known, the communality of the first variable may be computed. The same 
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procedure can be repeated with the second and with each succeeding row of 
Table 12. In this manner all of the communalities in the upper left quadrant 
of Table 12 may be determined. The same method can be used to determine 
the communalities in the lower right quadrant of Table 12. The reason these 
procedures have been investigated is the belief that if a communality is ex- 
pressed as a function of a large number of fallible coefficients the determina- 
tion is more stable than when the determination is made with a small num- 
ber of fallible coefficients. 

7. Expansion of principal minors of order (n 1) 

It is possible to write n principal minors of order (n1) in a square 
matrix of order n. If the expansion of each of these n principal minors of 
order (n 1) is set equal to zero, the rank of the matrix is assumed to be not 

Table 15 





1 


2 


3,4 


1 


hi 


.56 


.40 


5,6 


1.36 


1.39 


1.50 


7,8 


.64 


1.16 


2.35 



greater than (n 2). This follows from the property of a Gramian matrix 
that if all of its principal minors of order ra vanish, then the rank of the 
matrix does not exceed (m 1). Since there are n principal minors of order 
(n 1), their expansions give as many equations as there are unknown diag- 
onal entries. A unique solution is obtained if the inequality (5) is satisfied. 
If this inequality is not satisfied, there should be no unique solution. In this 
method it is not necessary to know the rank. These considerations are of 
some analytical interest, but they do not seem to lend themselves to com- 
puting purposes. 

8. Expansion of principal minors of order (r+a) 

This should be a special case of the preceding method but less laborious. 
It is not necessary that the rank be known, but it is assumed that (r+d) is 
taken larger than the rank. The simplest case is that in which a= 1. This 
method requires that the number of tests covered by the expanded principal 
minors is such as to satisfy inequality (5) even though all the tests in the 
correlational matrix are not utilized. The development of this type of anal- 
ysis would be of interest, but it does not seem likely to yield practical com- 
puting methods. 



CHAPTER III 
THE CENTROID METHOD 
Principles of the method 

The centroid method is a general method of factoring a symmetric matrix 
with real elements.* Its application to the factor problem involves finding 
F when R is known, so as to satisfy the fundamental factor theorem, 
FF'=R. The chief requirements of a method of factoring the correlational 
matrix are that it must be applicable even though the diagonal elements are 
unknown and that it must be applicable even though the intercorrelations 
are subject to sampling errors. These two requirements preclude the use of 
the diagonal method of chapter ii, which is very simple in application when 
the entries are infallible and the diagonals known. 

The purpose of the centroid method in factor analysis is merely to factor 
the correlational matrix. Any other method would serve the purpose equal- 
ly well provided that the .minimum rank of R with unknown diagonals is not 
altered. When the correlational matrix has been factored into F and F', the 
entries of F cannot be given scientific interpretation until F has been rotated 
so that the new reference axes represent primary factors. 

Each correlation coefficient in R may be expressed in the form (37~i) 

(1) r# = CLjl O>k1 + O>]2 ClkZ + - + Ct/r O>kr , 

in which there are as many terms hi the right member as there are factors 
in R. The numerical values of a 3 - TO are determined by the arbitrary locations 
of the orthogonal reference vectors, since a,- w is the projection of the trait 
vector j on the reference vector m. The subscript j defines a row of R, and 
the subscript k defines a column of R. 

The traits are represented by a set of n trait vectors in a space of r dimen- 
sions, and the scalar product of each pair of vectors is the correlation be- 
tween them. It has been shown that this configuration represents the inter- 
correlations and that these are independent of the locations of the orthog- 
onal reference vectors that are implied in (1). Hence the reference vectors 
may be rotated without any effect on the intercorrelations. 

* The first form of the centroid method was described in "Multiple Factor Analysis." 
It was improved by the elimination of arbitrary subgroups in "A Simplified Multiple 
Factor Method/ 7 The method has been further improved as described in this chapter. 
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Let the co-ordinate system be rotated so that the centroid of the system 
lies in the first axis of reference. The case in which the centroid is at the 
origin will be discussed in a later paragraph. Then 

(2) r } ' k = a-! aii + a/2 aia H ----- 1- <*>& <*>& - 

Summing (2) for all traits j in column k of R, we have 



(3) 



h 4 



and summing for all columns k so as to include all entries in R, we have 

n n 

(4) v 2%# 



n n 



n n 



But 

(5) 

and hence 



In "12 !~ n 

T^ +*+ 2 
y-i j LJ-I 



The r co-ordinates of the centroid of the system of n points are 

>, ..., -i>- 

y-i 

The co-ordinate axes have been so rotated that the centroid lies in the first 
axis of reference. The centroid therefore has zero projections on all the re- 
maining (r 1) co-ordinate axes. Hence 



(7) 

so that the r coordinates of the centroid are 

J& , , , . . . , . 
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Substituting (7) in (6), we have 



(8) 

k=i y=i 



In I 
2* 
J=l J 



in which r* is defined as the sum of all the coefficients in R including the 
diagonal terms. The first co-ordinate of the centroid is also its distance from 
the origin, since the remaining (r 1) co-ordinates vanish. Hence the dis- 
tance of the centroid from the origin is 



(9) 

or, by (8), 



*-5J22"-s^- 

\ A=l J=l 



Substituting (7) in (3), we have 
(11) 



y=i y-i 

and from (8) it follows that 

(12) r k = 

where r fc is the sum of all the coefficients in column k of R. If the sum of the 
coefficients in column k and the sum of all the coefficients in R are known, 
the projection of the vector k on the first axis of reference through the cen- 
troid is also known, namely, 

(13) <&--5-. 

vr t 

By (13) the first co-ordinate of each trait may be found . 

The numerical value of the first term in the right member of (2) is known 
by (13), and hence (2) may be transposed so as to show the first-factor 
residuals. Let the first-factor residuals from which the second co-ordinates 
are to be found be designated rj&.# for j and k. We have then from (2) 

(14) r 2 .yjfe = r& 
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Summing for column k, 



(15) 



From (7) it follows that 
(16) 



The sum of the residuals is zero in each column. 

The number of terms in the right member of (2) is the rank of R. The 
number of terms in the right member of (15) is (r-1), and hence this is the 
rank of the table of first-factor residuals. The entries in the residual table 
may be regarded as the scalar products of all pairs of residual vectors in a 
space of (r 1) dimensions. From (7) it is seen that the (r1) co-ordinates 
of the centroid of the residual vectors are zero, and hence the centroid is at 
the origin in the (r1) subspace. This precludes the direct application of 
formulae of the type (13) in determining the second and subsequent co-ordi- 
nates of the n points. 

In order to make the centroid method applicable in this situation, where 
the centroid of the system is at the origin, it is necessary to remove the cen- 
troid from the origin. In order to accomplish this purpose without destroy- 
ing the identities of the traits, a new concept will be introduced. Every 
point represents a trait. The diametrically opposite point represents the 
diametrically opposite trait, which will be called the reflection or image of the 
given trait. If a trait +A is represented by the co-ordinates a a , OM, . . . , 
air, then the co-ordinates of the reflected trait A are an, a^, . . . , 
air. Either the point or its reflection through the origin may be used to 
represent the trait as long as the proper sign is attached to it. In this sense 
the score on A may be replaced by the same score with negative sign to rep- 
resent A. Both scores represent the same trait except for sign. If A repre- 
sents the trait "tactfulness," then A represents "plus tactlessness," or 
"minus tactfulness." The identity of the trait is easily established with a 
simple reversal of sign. If some of the traits are reversed in sign, the cen- 
troid of the system will be removed from the origin without disturbing the 
identities of the traits, To reverse the signs in a row of F is to reflect the 
point through the origin, and it has been shown that this reversal of sign 
causes a reversal of sign in the corresponding row and in the corresponding 
column of R. The reflection of a trait is accomplished merely by changing 
the signs of the correlation coefficients in its row and in its column of R or 
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in the residual table. The correlation coefficients in RQ have the same rela- 
tion to the common-factor space as the residual coefficients have to the 
residual subspace. 

The next question is to decide which traits to reverse in sign. If all of 
them are reversed, it is clear that the correlational matrix, or the residual 
table, remains unaffected and the centroid remains at the origin. It is neces- 
sary, therefore, to reverse the signs of only some of the traits in the battery. 
It is desirable to account for as much as possible of the residual variance by 
each successive factor, and this should be a guiding consideration in deciding 
upon the traits which are to be reversed. If there is a clustering of traits in 
the (r 1) subspace which is balanced by a scattering of traits on the oppo- 
site side of the centroid, it is desirable to pass the second reference axis 
through the cluster. The second axis may be passed anywhere in the resid- 
ual subspace because it is orthogonal to the first axis of reference, which has 
already been located through the centroid of the original system. Since the 
subspaces are frequently of order higher than the third, it is not feasible to 
use any direct graphical methods for finding the clusters. Rather simple 
considerations should make it possible to accomplish the same purpose 
analytically. 

If a trait is in a cluster, its correlations will be high and positive with the 
remaining traits in the cluster; while if it is unique, in the sense that it is 
relatively remote from the other traits, its correlations with the other traits 
will be near zero. If a remote trait is reflected, its correlations will be re- 
versed in sign, so that the majority of them are positive. The principle to be 
applied here is that every trait the majority of whose correlations are nega- 
tive is to be reversed in sign. This will tend to bring all of them into a 
hemisphere, and the centroid will then be removed from the origin without 
destroying the identities of the traits. 

When the sign reversals have been made so that the majority of the cor- 
relations for each trait are positive or zero, the centroid method may again 
be applied as before, by rotating the co-ordinate axes about the first cen- 
troid axis so that the centroid of the residual configuration lies in the second 
axis of reference. This axis will be orthogonal to the first axis of reference 
because the (r 1) subspace is orthogonal to the first axis of reference. 

It will be found that the majority of the points in the subspace will have 
projections on the second centroid axis which are positive or zero. If it is 
desired, it is always possible to make a few additional sign changes so as to 
insure that the sum of every column in the residual table is positive or 
zero. This guarantees that the projection of every point in the subspace 
will have a positive projection on the new centroid axis. This additional ad- 
justment is probably not ordinarily worth the additional computation, be- 
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cause it will not noticeably affect the location of the new centroid axis. The 
principle recommended for practical computations is to reverse the signs of 
one trait at a time until the number of negative coefficients in the residual 
table is less than n/2. It will be shown that this computation can be easily 
routinized. 

After reflection, let (14) be written in the form 

(17) r f , jk a; 2 'a^ + <&a& + + af r 'a . 

The value of a/2 is equal to either +a/ 2 or o 2 , depending on whether j 
has been reversed in sign. The correlation r 2 . & is equal to the residual cor- 
relation 7*2 . & if neither j nor k has been reflected or if both of them have been 
reflected. If only one of the traits j and k has been reflected, then r 2 #= 
~r 2 . fk. 

After the reflections, let the residual vectors in the residual subspace be 
rotated so that the new centroid lies in the second orthogonal reference axis. 
Then 

/1Q\ *' ftfft^fff 1 fifff^ftf \ _L_ n fff n fff 

(18) r 2 /fc = a ; - 2 a A2 + a jz a ks + + a jr a^ . 

Summing (18) in a manner similar to that shown in (3) and proceeding as 
in (3) to (12), inclusive, we have 

(19) r^-aijV^, 

where r 2A is defined as the sum of the first-factor residual coefficients in 
column k after reflection, and r 2i is defined as the sum of all the first-factor 
residual coefficients after reflection. From (19) the values of aj{' may be 
found. If k has not been changed in sign, this is the second co-ordinate of 
k. If k has been changed in sign, then the second co-ordinate of k is aij'. 

The procedure for the remaining factors is the same. When the sign re- 
versals have been made, the centroid method is used again by rotating the 
co-ordinate system about the axes that have been established, so that the 
centroid of the residual configuration lies in the next orthogonal reference 
axis. Each successive residual table is reduced in rank by 1. When r factors 
have been extracted, the rth-factor residuals all vanish if the rank of R is r. 

A useful check on the arithmetical work is as follows: Summing (13) for 
all tests fc, 
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But by (8), (11), and (12), 

(21) 



r k = 



and hence 
(22) 



k=l 



The sum of the factor loadings is equal to the square root of the sum of all 
coefficients in the correlation table. This check is applicable for each of the 
successive factors. 

Example 1. Unity in the diagonal cells 

A small correlation table will be used in four examples of the centroid 
method. The examples differ only in the diagonal entries. In Table 1 are 
shown the intercorrelations of three fictitious variables with self-correla- 

Table 1 





++1 


42 


+ +3 




++1 
+ -2 
+ +3 


+ -fl. 000000 
-4- -480000 
+ + .560000 


~ + .480000 
+ 4-1.000000 
- + .420000 


+ + .560000 
- + .420000 
+ +1.000000 




A 
B 
C 
D 

E 
K 


+ .080000 
+1.040000 
4-1.000000 
4-2.040000 
4- -838435 
+ .838435 


- .900000 
+ ,900000 
+1.000000 
+1.900000 
+ .780895 
- .780895 


+ .140000 
+ .980000 
+1.000000 
+1.980000 
+ .813775 
+ .813775 


5.920000 
2.433105 
.4109975 



tions of unity. Before each entry there are two signs. The first one is the 
given sign. The given variables may be designated by number and sign, 
as +1, +2, +-3. In order to displace the centroid from the origin, the signs 
may be reversed so that the sum of the coefficients in each column (omitting 
the diagonal entry) shall be positive. In row A these column sums are re- 
corded. The second column has the largest negative sum. In this example 
there is only one column with negative sum. Hence, variable 2 is reversed 
in sign. These sign reversals in both column 2 and row 2 are shown by the 
second sign before each coefficient. 

The new sums (omitting diagonals) are shown in row B. All of the sums 
are now positive. If negative sums remained, further sign reversals would 
be made, as shown in subsequent examples. In row C is shown the diagonal 
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entry for each column and in row D is shown the sum of all coefficients in 
each column. 

The last entry in row D is the sum of all coefficients in the table. It is r t 
in (8). The square root of this sum is also recorded; and immediately below 
this is recorded its reciprocal, as required in (13). In row E is shown the 
first-factor loading for each of the three variables with signs used in Table 1. 
In row K are shown the first-factor loadings with signs which correspond to 
the original positive signs of the variables. 

In order to extract the second factor, Table 2 is prepared with the resid- 
uals (14). In computing these residuals, the given variables are taken with 
the following signs, +1, 2, +3. These are the signs used in computing 
row E in Table 1. The factor loadings in row E then correspond to o! & in 
(14). 

Table 2 





++1 


-+2 


++3 




++1 

-+2 
+ +3 


+ + . 297027 
- + . 174730 
--.122297 


- + . 174730 
+ +. 390203 
- + . 215473 


. 122297 

- + . 215473 
+ +. 337770 




So 
A 
B 
C 
D 
E 
K 


.000000 
-.297027 
+ .052433 
+ .297027 
+ .349460 
+ .279719 
+ .279719 


.000000 
-.390203 
+ .390203 
+ .390203 
+ .780406 
+ .624662 
+ .624662 


.000000 
-.337770 
+ .093176 
+ .337770 
+ .430946 
+ .344943 
+ .344943 


1.560812 
1.249325 
.800432 



In row So of Table 2 are shown the sums of the columns. These all vanish, 
as proved by (16). Row A shows the sum of the coefficients in each column 
(omitting the diagonal entry). The second column has the largest negative 
sum. Hence, variable 2 is reversed in sign. 

After reversing the signs of the residuals for variable 2, the new sums are 
recorded in row B (omitting diagonals). The diagonal entry for each column 
is recorded in row C. The sum of all coefficients in each column is shown in 
row D. The same procedure as before gives the second-factor loadings 
shown in row E. Row K is the same as row E because all three of the vari- 
ables happen to be positive. 

Table 8 was prepared in order to extract the third-factor loadings. In it 
are recorded first the residuals from Table 2, The first sign is positive for 
each of the three variables because that is the sign arrangement which re- 
sulted after the sign reversal for the second factor. In row 2 is recorded the 
sum for each column which is zero. This is a check on the arithmetical work. 
Row A shows the sum of each column, omitting the diagonal entry, Since 
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both the first and the third columns have the same negative sum, it is im- 
material which of them is reversed in sign for the extraction of the third- 
factor loadings. The first variable was here reversed in sign. In row B are 
recorded the new sums, omitting the diagonal entries. In row C are 
recorded the diagonal entries. In practice the repetition of row C is not 
needed, since the diagonal entries are available in the correlation table. 
Row D is the sum of rows B and C. The sum of all entries in row D is re- 
corded at the right. It is the sum of all the coefficients in the residual table. 
Next below it is its square root; and next below that is recorded the recipro- 
cal, as before. The multiplier is then applied to rowD; and the result is row 
E, which contains the third-factor loadings with the signs of the variables 
after reversals for the third factor. In row K are recorded the third-factor 
loadings which correspond to the three variables taken with positive sign. 

Table 8 





41 


++2 


+ +3 




+-1 

+ +2 
+ +3 


+ + . 218784 
+ -. 000000 
- + . 218784 


+ -. 000000 
+ + . 000000 
+ + . 000000 


- + . 218784 
+ +. 000000 

+ + . 218784 




So 
A 
B 
C 
D 
E 
K 


.000000 
-.218784 
+ .218784 
-K 218784 
+ .437568 
+ .467744 
-.467744 


.000000 
.000000 
.000000 
.000000 
.000000 
.000000 
.000000 


.000000 
-.218784 
+ .218784 
+ .218784 
+ .437568 
+ .467744 
+ .467744 


.875136 
.935487 
1.068962 



If the same process is repeated in the attempt to extract a fourth factor 
from the residuals of Table 3, it will be found that all of the residuals vanish 
exactly. Hence the intercorrelations have been described in terms of as 
many factors as there are variables. The rank of the given Table 1 is 3, and 
this is also the number of factors which will exactly account for the inter- 
correlations. 

The three factor loadings for each of the three variables are summarized 
in the upper half of Table 4- These factors reproduce the given intercorre- 
lations in TaUe 1. In the lower half of Table 4 are recorded the factor load- 
ings with the signs which correspond to those with which the factoring of 
Table 1 was made. In that table the second variable was reversed in sign. 
When the factor loadings are taken with signs to correspond to the signs of 
the variables with which the factoring is made, the sum of each column 
after the first vanishes. This is shown at the bottom of Table 4. 
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Example 2. Commonalities in the diagonal cells 

In the first example unity was recorded in each diagonal cell. As a con- 
sequence, the rank of the correlational matrix became equal to its order, 
namely, 3. In the present example the communality is recorded in each 

Table 4 





I 


II 


III 


+1 

+2 
+3 


4- .838435 
- .780895 
+ .813775 


+ . 279719 
+ . 624662 
+ . 344943 


-.467744 
.000000 
+ .467744 





I 


II 


III 


+1 
~2 

+3 


+ .838435 
+ .780895 
+ .813775 


+ .279719 
-.624662 
+ .344943 


-.467744 
,000000 
+ .467744 


S 


+2.433105 


.000000 


.000000 



diagonal, with the result that the rank of the matrix is reduced to 1. The 
procedure of extracting the factor loadings is here exactly the same as in the 
previous example. 

Table 5 shows the given intercorrelations, as well as the diagonal commu- 
nality-entries which are assumed to be known in this example. The calcula- 

T able 5 





++1 


+-2 


++3 




++1 

H 2 

+ +3 


+ + .640000 
- + .480000 
+ + .560000 


-+ .480000 
+ + .360000 
-+ .420000 


+ + .560000 
-+ .420000 
+ + .490000 




A 
B 
C 
D 
E 
K 


+ .080000 
+1.040000 
+ .640000 
+1.680000 
+ .800000 
+ .800000 


- .900000 
+ .900000 
+ .360000 
+1.260000 
+ .600000 
- .600000 


+ .140000 
+ .980000 
+ .490000 
+1.470000 
+ .700000 
+ .700000 


4.410000 
2.100000 
.4761905 



tions are summarized in the several rows below the correlational matrix. 
Row A shows the sum of the coefficients In each column, omitting the diag- 
onal entry. The largest negative sum is for the second column, and hence 
the second variable is reversed in sign. The resulting sums, omitting diag- 
onals, are recorded in row B. The diagonal entries are repeated in row C. 
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Row D is the sum of rows B and (7. Hence row D shows the sum of all the 
coefficients in each column. The total at the right of this row is the sum of 
all the coefficients in the table. Immediately below it is its square root, and 
below that is the reciprocal. This is the multiplier by which row E is ob- 
tained from row D. Row E shows the factor loading for each variable with 
the sign that was used for the factoring. Reversing the sign of the second 
variable, we have the factor loadings in row K, which represent the load- 
ings when all of the variables are taken with positive sign. 

If an attempt is made to obtain second-factor loadings for these three 
variables, it will be found that all of the residuals vanish. Hence one factor 
is sufficient to describe all of the intercorrelations in this table. The rank of 
the given correlation table is therefore 1, although its order is 3. 

Example 3. Each diagonal entry greater than the communality and less 
than unity 

In the first example it was shown that when unity is recorded in the diag- 
onals of the correlational matrix, the intercorrelations can be described ex- 
actly in terms of as many factors as there are variables. In the second ex- 

Table 6 





+1 


+2 


+3 


+1 

4-2 
+3 


+ . 700000 
-.480000 
+ .560000 


-.480000 
-f .500000 
-.420000 


-f . 560000 
-.420000 
+ .600000 



ample it was shown that when the communalities are recorded in the diag- 
onal cells, the rank of the matrix is reduced, so that a single factor is suffi- 
cient for the particular example here used. In the third example an arbi- 
trary diagonal entry is recorded which is greater than the communality but 
less than unity. The resulting correlational matrix can be described in 
terms of as many factors as there are tests or variables. 

Table 6 is such a matrix in which the diagonal entries exceed the commu- 
nalities by arbitrary increments. The extraction of the factors is effected 
in exactly the same manner as has been described in the first two examples. 
The result is summarized in Table 7, which shows the three factor loadings 
for each of the three variables. The third-factor residuals vanish exactly. 

Example 4. Each diagonal entry less than the commonality 

It is of some interest to know that the centroid method of factoring a 
symmetric matrix is applicable not only to those matrices whose factors are 
real but also to those symmetric matrices whose factors are imaginary. 
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When the diagonal entries are made less than the communalities, the Gram- 
ian properties of the correlational matrix are destroyed and the factors are 
then imaginary. The fourth example illustrates this case. Table 8 contains 

Table 7 





I 


II 


III 


+1 

+2 
+3 


+ .800900 
-.644402 

-f .727254 


+ .124013 
+ .291112 
+ .J67098 


-.207798 
.000000 
+ .207798 



Table 8 





+1 


+2 


+3 


+1 

+2 
+3 


+ .600000 
- .480000 
+ .560000 


-.480000 
+ .250000 
-.420000 


+ .560000 
-.420000 
+ .400000 



the same intercorrelations as those of Table 1, but the diagonal entries have 
been reduced below the communalities by arbitrary decrements. Applica- 
tion of the centroid method in exactly the same manner as for the previous 
examples gives the factor loadings shown in Table 9. The second and third 
columns of the factorial matrix of Table 9 are imaginary. The co-ordinates 
of this table reproduce the intercorrelations exactly and the third-factor 
residuals all vanish. 

Table 9 





I 


II 


III 


+1 


+ .803111 


+ .106981* 


-.183145* 


+2 


-.563157 


+ 259126i 


.000000 


+3 


+ .675789 


+ .152146* 


+ .183148* 



Example 5* A fictitious eight-variable problem with known communalities 

The first four examples are intended to show the factoring of a symmetric 
matrix with four different conditions as regards the diagonal entries. The 
fifth example is intended to illustrate a variant procedure in selecting the 
variables which are to be reversed in sign. The sign-reversing method which 
is described here for the fifth example is recommended for most practical 
problems, since it is simpler in computation than the more complete method 
of the first four examples and since the simpler method gives results that are 
almost identical with those of the more elaborate sign-changing method. 
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Table 10 shows the intercorrelations of eight variables whose self-correla- 
tions are known. All of the intercorrelations are here taken to be positive. 
At the bottom of each column is recorded the sum of the coefficients in the 
column, including the diagonal entry. The reason for taking the sum of all 
the entries in this example is that no sign reversals are necessary for the 
first factor when all of the given intercorrelations are positive. In the lower 
right corner of the table are shown the entries which are required to deter- 
mine the multiplying factor. In the last row of the table are the first-factor 
loadings. 

Table 10 





1 


2 


3 


4 


5 


6 


7 


8 




I 

2 
3 

4 
5 
6 

8 


.64 
.56 
.16 
.24 
.72 
.64 
.40 
.24 


.56 
.65 
.38 
^9 
.67 
.72 
.63 
.53 


.16 
.38 
.40 
.48 
.24 
.40 
.52 
.54 


.24 
.49 
.48 
.58 
.34 
.52 
.64 
.65 


.72 
.67 
.24 
.34 
.82 
.76 
.52 
.35 


.64 
.72 
.40 
.52 
.76 
.80 
.68 
.56 


.40 
.63 
.52 
.64 
.52 
.68 
.74 
.71 


.24 
.53 
.54 
.65 
.35 
.56 
.71 
.73 




D 
K 


3.60 
.617940 


4.63 

.794740 


3.12 
.535548 


3.94 
.676301 


4.42 
.758693 


5.08 
.871983 


4.84 
830787 


4.31 
.739812 


33.94 = SI> 
5.8258047 = J/S.D 

.17165011 = i/ V-LD 



The first-factor residuals are shown in Table 11. In front of each residual 
there are one, two, or three signs. Each sign is recorded in the first, the sec- 
ond, or the third position. The first sign is the sign of the residual as ob- 
tained from the given coefficients and the factor loadings of Table 10. In 
row So is shown the sum for all the coefficients in each column. These sums 
vanish, as shown in (16). 

In order to select the variables which are to be reversed in sign so as to 
move the centroid of the system as far as possible from the origin, Table 12 
was prepared. It will be referred to as the sign table. This table illustrates 
a variant method of sign changing. At the top of the table are listed the 
variables from 1 to 8. In the first row is shown the number of negative en- 
tries in each column of Table 11. It so happens that in this example the num- 
ber of negative entries is four for each column. Ordinarily these sums are 
not all the same. 

The usual procedure is to select for sign reversal that variable whose col- 
umn has the largest number of negative entries. Since this is four for each 
column, it is immaterial which of the variables is chosen for the first sign 
reversal. The first variable is so chosen. It is recorded at the right of the 
second row in Table 12. The fact that the first variable is to be reversed in 
sign is also indicated by the cross at the top of column 1. 

There are eight entries in each column of Table 11; but since the diagonal 
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entry is always positive, there are only seven entries in each column that 
are subject to sign reversal. In the first column of Table 11 there are four 
negative and three positive items, ignoring the diagonal entry. Hence, 
when the first variable is changed in sign, there will be three negative signs 
in the first column. This is the first entry in the second row of Table 12. 

Each of the succeeding entries in the second row of Table 12 is determined 
in the following manner. If the sign in the first row of Table 11 is positive, 
then the entry in the first row of Table 12 is augmented by 1. If the sign in 
the first row of Table 11 is negative, then the entry in the first row of Table 
12 is reduced by 1. In this manner the remaining entries in the second row 
of Table 12 are determined. 

The procedure is summarized with the following notation: 

n,- 5= given correlation or residual; 

NJ s= number of negative signs in the jth column of the given table of 

correlations or residuals; 
n = number of variables; 

Aij =the entry in the ith row and jih column in the sign table; 
BH s= +1 or 1. The sign of BH agrees with the sign of r^; 

C 3 - s+1 or 1. The sign is taken negative if variable j has been re- 
versed in sign an odd number of times. Otherwise it is taken 
positive; 

ki 55 variable which is reversed in sign in row i; 
AM Z~AH where j=k it A ik is the largest value of A. a in row f. 
The successive steps in reflecting the variables are as follows: 

1) The first row of the sign table contains NJ in column j; 

2) Select the highest value of NJ. Let it be column k. The variable k is 
to be reversed in sign; 

3) Record k at the end of row 2; 

4) Make a cross or check mark at the top of column fc; 

5) Record A a in the next row where 

(23) Ay = n - 1 NJ when j = k , 
and 

(24) Ay = A (if -w + Bad when j ^ k ; 

If a correlational entry is zero, count it as positive. 
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6) Find A^ the highest value in row i. Check the top of its column and 
record the number of the column at the end of row (i+1); 

7) Record Ay, as in step 5, for each row of the sign table until all entries 

A i3 -<n/2. 

The columns which are checked are to be reflected in the table of correla- 
tions or residuals. 

A check on the arithmetical work of each row i is as follows: 



(25) Av = A <-> - 2[A (w)fc - A ik ] . 

y=i 3=1 

The sign table shows that variables 1, 2, 5, and 6 are to be reflected. The 
signs are reversed in the second position for these four rows in Table 11. Then 
the signs are reversed in the third position for the four columns. After mak- 
ing these sign reversals as shown in Table 11 } each residual is to be taken 
with the sign that is next in front of it, irrespective of its position. 

The rows and columns are designated by numbers. The sign reversals are 
also recorded in front of these numbers so as to show at a glance which of 
the variables have been reflected. 

In row D is recorded the sum of each column after reflection. At the low- 
er right corner of the table are shown the entries for the multiplier. In 
row E are shown the resulting second-factor loadings with signs to corre- 
spond to the reflected variables. In row K are shown the factor loadings for 
the original unreflected variables. 

A check on the arithmetical work is that the sum of row E must equal 
1/SJD. This is the check described by (22). 

A repetition of the same procedure for the second-factor residuals shows 
that they all vanish. Therefore the given coefficients in Table 10 can be ac- 
counted for exactly by two factors. 

Table 13 gives a summary of the factor loadings for the eight variables. 
Two factor loadings are shown for each variable. The cross products in this 
table reproduce the correlations of the original unreflected variables. 

The two methods of sign changing that have been described may be com- 
pared as follows: In the first and more complete method, that trait is re- 
flected which has the largest absolute negative sum of coefficients in its col- 
umn. After reflecting this trait, the sums are again determined, and the 
trait with the largest negative sum of coefficients is reflected. This pro- 
cedure is continued until all of the sums of columns are positive, when the 
diagonal entries are ignored. In the second and shorter method, that trait 
is reversed in sign which has the largest number of negative coefficients in its 
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columns. After the reflection, the trait which has the largest number of 
negative coefficients is reflected. This process is continued until no trait re- 
mains for which the number of negative coefficients exceeds (n 1)/2. The 
two methods may be combined by using the shorter method first. When the 
number of negative signs has been minimized as described, there may still 
remain one or more small negative column sums, omitting the diagonal en- 
tries. The first method can then be used until all of the column sums are 
positive. This is the procedure which is illustrated in example 6, but in 
practice it is probably not worth the additional labor to make any refine- 
ments beyond the shorter procedure. The first method can be arranged 
with a check column in a manner similar to that of (25) . 

Table 18 

Factor Loadings in Fictitious 
Eight-Variable Example 





I 


II 


1 

2 
3 

4 
5 
6 

7 
8 


4- .617940 
+ .794740 
4- .535548 
+ .676301 
-f- .758693 
4- .871983 
4- .830787 
4- -739812 


-.508084 
- . 135603 
+ .336434 
4-. 350 166 
-.494353 
-.199114 
4-- 223 146 
+ .427408 


S 


+5.825804 


.000000 



Example 6. The centroid method with unknown diagonals 

In the previous examples it has been assumed that the diagonal entries 
were known. The sixth example illustrates the application of the centroid 
method to an actual set of data. Since the communalities are unknown, the 
diagonal entries are also unknown. The diagonal entry will be estimated by 
method No. 4 in chapter ii. Fortunately, the diagonal entry may be given 
any value between zero and unity without affecting the results markedly, 
especially when the number of variables is as large as twenty or thirty or 
more. Hence even rough estimates of the diagonal entries are sufficient for 
reasonably accurate factor loadings by the centroid method. 

Table 14 contains the inter correlations of fifteen psychological tests that 
were used by Professor Brigham in a recent experimental study. The tests 
will be identified by the same numbers that were used by Brigham.* In each 

*Carl G. Brigham, A Study of Error (New York: College Entrance Examination 
Board, 1932), p. 275. 
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diagonal cell is recorded the highest correlation in the column. In row D is 
shown the sum of each column. In the lower right corner are recorded the 
sums required for the multiplier, and in row E are recorded the resulting 
first-factor loadings. These sums are checked by (22). Since all of the tests 
are positively intercorrelated, it is not necessary to reverse any of the signs 
in this table. The row K shows the factor loadings with the original signs of 
the tests. Since no sign changes are necessary, the last two rows are identi- 
cal. 

Table 15 shows the first-factor residuals. The diagonal entries are re- 
corded first as residuals from the previous table. The sum of each column of 
residuals, including the diagonal, is recorded in row S . Each of these sums 
should be zero. Since the residuals are recorded to three decimals, the sum 

Table 16 



10 


2 


5 


3 


4 


1 


X 

8 


X 

7 


X 

9 


X 
6 


X 

15 


X 

14 


X 

17 


X 

11 


X 

18 


Check 


*, 


9 


9 


9 


9 


7 


7 


8 


7 


9 


8 


10 


8 


7 


8 


9 


124 


15 


8 


8 


8 


8 


6 


6 


7 


8 


8 


7 


4 


9 


8 


9 


8 


112 


14 


7 


7 


7 


7 


5 


5 


6 
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10 
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104 


11 
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6 
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7 
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4 


10 


4 


10 


92 


18 


5 


5 


5 


5 
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3 


8 


7 


7 
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3 


3 


11 
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4 


80 


17 
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4 


4 


4 


4 


2 


7 


8 


8 


7 


2 


2 


3 


2 


3 


64 


7 


3 


3 


3 


3 


3 


3 


8 


6 


9 


8 


1 


3 


2 


1 


4 


60 


9 


2 


2 


2 


2 


2 


2 


9 


5 


5 


9 


2 


2 


1 


2 


5 


52 


6 


1 


1 


1 


1 


3 


1 


10 


4 


4 


5 


3 


1 





3 


6 


44 


8 
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2 


4 


3 


3 


4 


4 


2 


1 


2 


5 


32 





of the residuals in each column will be zero except for the discrepancies 
which are caused by rounding off the third decimal of each residual. The 
fact that these sums vanish within a small discrepancy in the last decimal 
place proves the arithmetical work. 

Before the second-factor loadings can be extracted, some of the variables 
must be reflected. In order to ascertain which variables to reflect, Table 16 
is prepared. A cross (X) at the top of a column indicates that the test of 
that column is to be reflected. The procedure in preparing this table is 
similar to that of the tables of sign changing which have already been de- 
scribed. 

The sign changes are indicated in Table 15. A sign in the first position is 
the sign of the residual. The change of sign in each row is indicated in the 
second position. The change of sign in each column is shown in the third 
position. The sign next in front of the residual is the sign which is used in 
summing each column for the second-factor loading. 
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Each diagonal residual is erased. In each diagonal is recorded, instead, 
the largest residual of the column, irrespective of its sign. The diagonal 
entry is always recorded with positive sign. 

The sum of each column is shown in row D. In row E is recorded the sec- 
ond-factor loading for each test after reflection. In row K is recorded the 
second-factor loading for each test taken with positive sign. 

Table 17 shows the second-factor residuals, and Table 18 is the corre- 
sponding table of sign changes. The procedure is the same as for the pre- 
ceding tables. The sum of each column, including the diagonal residual en- 
try, is shown in row So. The fact that all of these sums vanish within a small 
discrepancy in the last decimal proves the arithmetical work. The diagonal 
residual entries are then erased, and the absolute maximum of each column 
is recorded in the diagonal cell. The sign changes indicated in Table 18 are 
then made. The sum of each column, without the diagonal, is shown in the 
first row of Table 19. There are several negative entries in this row, namely, 
for columns 2 and 9. Variable 9 is changed in sign. The new sums, omitting 
diagonals, are recorded in the second row of Table 19. The second entry is 
still negative. The second variable is changed in sign, and the new sums are 
recorded in the third row. A negative sign appears in column 4. The fourth 
variable is then changed, and the sums are recorded in the last row. All 
sums are now positive. The entries in the last row of Table 19 are added 
to the diagonal entries in Table 17. The sums are recorded in row D of Table 
17. The factor loadings are recorded in row E. In row K are found the 
third-factor loadings for the original unreflected tests. 

Tables 20 and 21 are prepared in a similar way for determining the fourth- 
factor loadings. 

Tables 22, 23, and 24 are prepared for the fourth-factor residuals and the 
fifth-factor loadings. The residuals in Table 22 are so small that they can be 
ignored. The standard deviation of discrepancies is .024. 

The five factor loadings for each of Brigham's fifteen tests are summarized 
in Table 25. The contributions of the fifth factor to the correlations can be 
ignored. Each of the given intercorrelations can be reproduced from the 
first four factor loadings of this table within the discrepancies which are re- 
corded in Table 22. It is an error, frequently made, to attempt a psychologi- 
cal interpretation of the factors in Table 25. It is not unlikely that each col- 
umn of this table has psychological meaning, but there is no guaranty that 
such interpretation will be useful or fundamentally significant. The table 
represents merely the arbitrary centroid co-ordinates of a set of fifteen 
points in a space of five dimensions. The orthogonal reference axes which 
are obtained by the centroid method and which are represented by the five 
columns of Table 25 must be rotated into a new set of orthogonal or oblique 
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reference axes before psychological interpretation can be made with confi- 
dence. It has been shown that there exists an infinite set of orthogonal refer- 
ence axes in terms of which the fifteen test vectors may be described as well 
as by those which are obtained by the centroid method. One of the principal 
problems in factor theory is to find a computationally feasible criterion by 
which this rotation can be effected uniquely and by which the reference 
axes so obtained have fundamental psychological meaning. The solution of 
this problem is described in several of the subsequent chapters. These solu- 
tions all begin with a given factorial matrix like that of Table 25. All of the 
solutions will be presented with the same set of illustrative data wherever 
feasible. 

Correction for uniqueness 

It has been shown that the factorial matrix represents the co-ordinates of 
the termini of n trait vectors in a common-factor space of r dimensions. 
Table 25 represents therefore the co-ordinates of fifteen points in five dimen- 
sions. The square of the length of each trait vector represents its commu- 
nality. If the traits could be freed from the variable errors and from the 
specific factor, then the intercorrelations would be augmented in a manner 
analogous to the correction for attenuation. In correcting a coefficient for 
attenuation, the variable errors are removed. When a correlation between 
two traits is corrected not only for the variable errors but also for the specific 
factor in each test, the augmented correlation coefficient will be said to be 
"corrected for uniqueness." Hence the coefficients which are corrected for 
uniqueness are higher than those which are corrected only for attenuation. 

The geometrical interpretation of the correction for uniqueness is of some 
interest. It has been shown that the correlation between two traits is the 
scalar product of the two trait vectors in the common-factor space. If each 
of the vectors is extended to meet the unit sphere so that each vector be- 
comes a unit vector, then the scalar product of two such vectors is the cosine 
of their angular separation. The traits can then be represented as points on 
the surface of a hypersphere, and the angular separation between pairs of 
points represents the correlation after correction for uniqueness. For some 
problems in which the common-factor space can be reduced to three dimen- 
sions certain graphical methods are available in which each trait is repre- 
sented as a point on the surface of a sphere. The plotting of the trait vectors 
on the surface of a sphere is facilitated by correcting the coefficients for 
uniqueness, because the augmented coefficient represents the cosine of the 
angular separation of a pair of vectors. The correction of the intercorrela- 
tions for uniqueness also facilitates the isolation of clusters of tests. These 
applications will be described in subsequent chapters. 
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The correlation coefficient can be regarded as the scalar product of a pair 
of test vectors. The lengths of the vectors are the square roots of their com- 
munalities. Hence 



cos <t>jk 



so that 



cos 



~~- 



in which R& is the correlation coefficient, corrected for uniqueness. 

Table 26 shows the intercorrelations of Brigham's fifteen tests after cor- 
rection for uniqueness for four factors. 



Table 26 
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2 
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3 
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1 


8 


7 
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6 


15 


14 


17 


11 


is 


10 
2 
5 
3 
4 
1 
8 
7 
9 
6 
15 
14 
17 
11 
18 


1.000 
.990 


.990 
1.000 


.988 
.993 


.869 

867 


.886 
.856 


.766 
.771 


.478 
.448 


.498 
.464 


.480 
.401 


383 

314 


.648 
.572 
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.372 
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1 000 
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.802 
.856 
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.899 
800 


.999 
.983 
710 


.983 
.999 
679 
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.679 
1 000 
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.783 
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.557 
.548 
951 


.683 
.662 
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.999 



CHAPTER IV 
THE PRINCIPAL AXES 

A method of locating the principal axes* 

It has been shown that a set of traits may be regarded as n points in a 
common-factor space of r dimensions. It has also been shown that by rota- 
tion of F there exists an infinite number of factorial matrices which repro- 
duce the correlations in R. It is natural to inquire whether a rotational cri- 
terion can be found by which a unique solution F may be obtained. One 
solution is to adopt the principal axes as the reference axes of F. The prin- 
cipal axes are defined as follows: 

Definition : If the sum of the squares of the projections of the test vectors on a 

radial axis is stationary, the axis is a principal axis. 

It can be shown that a set of vectors in a space of r dimensions has r prin- 
cipal axes and that these axes are orthogonal. 

The attempted solution to the factor problem by which the trait vectors 
are described in terms of their projections on the principal axes is erroneous 
in spite of the fact that it is of considerable analytical interest. It will be 
described here with numerical examples partly because of its analytical 
interest but mainly because it will be shown in chapter vii to be psycho- 
logically meaningful when it is used in a modified form. 

At the outset it may be stated that the method of principal axes does not 
give psychologically meaningful results. The matrix F which represents the 
principal axes of a battery has two serious limitations, namely, (a) the ref- 
erence traits that are represented by the columns of F are a function of the 
number of traits of each kind that happen to be included in the battery, and 
(Z>) about half of the factor loadings beyond the first factor are necessarily 
negative. One of the fundamental requirements of a successful factorial 
method is that the factorial description of a trait must remain invariant when 
the trait is moved from one battery to another which involves the same common 
factors or abilities. When psychological tests are involved, a negative factor 

* The method of principal axes was first described in a paper which I presented at the 
Syracuse meeting of the American Association for the Advancement of Science in 1932. 
It was published in "The Theory of Multiple Factors," pp. 17-27. The method is given 
here in notation that is consistent with that of the previous chapters. Hotelltng's special 
case of the method was described by him in "Analysis of a Complex of Statistical Vari- 
ables into Principal Components," Journal of Educational Psychology, Vol. XXIV (Sep- 
tember and October, 1933). 
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loading implies an ability whose possession is a detriment to the test per- 
formance. Such a situation can be comprehended for unusual situations, 
but it is not conceivable that half of the factor loadings in all special abilities 
should be negative. The reader may regard the method of principal axes 
as of analytical interest, but he should not expect to be able to give psycho- 
logical meaning to the solution. A psychologically meaningful solution will 
be presented in chapter vii. 

Each of the reference traits may be regarded as a unit vector in the same 
space of r dimensions in which the traits are represented by vectors whose 
scalars are less than unity. Let one such reference vector be AI, and let its 
direction cosines be An, A 2 i, Asi, . . . , A r i in the common-factor space. The 
unit reference vector AI may be thought of as representing an imaginary 
pure trait. The correlation between a trait j and the reference vector AI will 
then be 

(1) TJA.I CtyiAn + O/2A21 -(- + a/rArl , 

or 

r 

(2) TjAi = /^ aj m \ m i . 

This correlation is the projection of the vector j on the unit reference 
vector AI . 

In order that the reference vector AI through the origin shall coincide 
with a principal axis of the system, it is necessary and sufficient that the 
sum of the squares of the projections of the trait vectors on the reference 
vector AI be stationary. We have then 

^ ^ ^ 

(3) Tj M = QjlAii ^ j fljmAmi + 0^21 / ^Cljm^ml ~T" * * " ~T~ Q/rA r i f Ctj m \nl y 

m=l m=l ml 

or 

(4) 3* - 

where the subscripts M and m refer to factors. Summing for n traits, 
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For convenience, let 

n 

(6) Vr? A1 = u . 

Then 



(7) 



dU 



(8) 



or 



(9) 



= 2V V 

d\Ml '^-4 -*< 

w=l j = l 



Since Xn, X2i, . . . , X r i are the direction cosines of the reference vector 
AI on the centroid axes, the solution is subject to the conditional equation, 

The constrained stationary values of u- which satisfy the conditional equa- 
tion (10) can be found by Lagrange's method of undetermined multipliers.* 
We have then the following (r+1) simultaneous equations. 



+ = 



du 



dv 






4- 



(io) 



+ 



+ X?i - 1 = . 



By means of these simultaneous equations the (r+1) unknowns Xn, Xj 
. . , X,i and ^ may be found. The partial derivatives of t; are of the form 



(12) 



dv 



* William F. Osgood, Advanced Calculus (New York: Maomillan Co., 1928), p. 180. 
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Substituting (9) and (12) in (11), we have 
(13) 

n \ n n n 

y=i / y=i y=i y=i 

n In \ n n 

zdji -f- X 2 ij ^ ^a^ H~ ft 1 ~\~ X 3 i / ^ QjftQ>fi -[-... -j- x rl ^ ^a^a^r = , 
y=i \y=i / y=i y=i 



3 - 2 + X 3 i a/s + ft + + X r ia /3 a/ r = , 
y=i y=i \y=i / y=i 



y=i y=i y=i 

The multipliers p can be found from the fact that the determinant of the 
coefficients in (13) must vanish in order that solutions shall exist other than 
the trivial solution Xu = X 2 i = X 7 i = 0. The expansion of the determi- 
nant of the coefficients in (13) gives the characteristic equation of degree r, 
which may be written as follows : 

(14) p r + ctf r ~ l + C2/3 r ~ 2 + h c r = . 

The coefficients c may be found by the following rules. 

The numerical term c r is the value of the rth order determinant of (13), 
ignoring p. The coefficient c r _i is the sum of all the (r l)-rowed principal 
minors in the same determinant. The coefficient c r _ 2 is the sum of all the 
(r 2)-rowed principal minors. The coefficient of /s r is always unity. The 
coefficient c x in (14) is the sum of all the x-rowed principal minors. All of 
the roots of (14) are real and negative. They may be designated fa, fo, 

Each of the roots p p is substituted, in turn, in (13). Each of the r values 
of /3j> gives a set of direction cosines for a principal axis. When the root $$ is 
substituted in (13), the solution gives the direction cosines of A p , which are 

The r principal axes A are orthogonal. In fact, their direction cosines 
may be arranged to form the matrix of the transformation from the given 
orthogonal co-ordinates in F to those of the principal axes. The matrix of 
the transformation is as given in Table L 
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It is of interest to note that each value p p is the sum of the squares of 
the projections of the trait vectors on the principal axis A^. By inspection 
of the numerical values of the roots p p , the major, mean, and minor axes of 
the system may be designated. 



Table 1 

2 Xl3 



X22 



X 3 r 



Xr2 



X rr 



A numerical example of the method of principal axes 

The method of principal axes consists in the rotation of the co-ordinate 
system of F so that the principal axes constitute the orthogonal axes of ref- 
erence. The principal axes may be defined as a set of orthogonal reference 
axes on each of which the sum of the squares of the projections of the trait 
vectors is stationary. 

Let Table 2 represent the factorial matrix F of a set of seven fictitious 

Table 2 



Tests 


I 


II 


III 


A2 


1 


+ .5 


-.2 


+ .4 


.45 


2 


+ .6 


o 


+ .6 


.76 


3 


-.6 


+ .5 


+ .3 


.70 


4 


-.3 


-f.4 


+ .6 


.61 


5 


-f.2 


+ .1 


+ .7 


.54 


6 


+ .6 


A 





.52 


7 


+ .7 


-.3 


+ .5 


.83 



tests in three factors. Geometrically, this table shows the three orthogonal, 
co-ordinates of seven points. The communalities are listed in a column sepa- 
rate from F. They are all less than unity to correspond to the fact that every 
mental test may be assumed to have a specific factor in any finite battery 
of tests. 

The sums required for the characteristic equation are as follows: 



= +1.95, 
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ftyia/2 = ~" 1.07 , 
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^2= + .75, 



n 

2 



.11 



= +1.71. 



These sums are substituted in the determinant of the characteristic equa- 
tion as follows: 



(15) 



(+1.95+18) -1.07 
-1.07 (+.75+0) 



+.69 
+.11 



+ .69 



+ .11 



=0. 



The values of p p must be such as to make the determinant of the coefficients 
of the three homogeneous equations vanish in order that non-trivial solu- 
tions for Xiy, X 2p , Xs p shall exist. 

Expanding the determinant of (15), we get an equation of the form 

3 + Ci/3 2 + C2j3 + c 3 = . 



(16) 

The numerical values of the coefficients of p p are determined by the rule 
previously given. The numerical value of c 3 is the value of the following 
determinant: 

+1.95 -1.07 + .69 

-1.07 + .75 + .11 
+ .69 + .11 +1.71 
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It is found to be zero. Hence the rank of the determinant is less than 3. 
This proves that Table 2 can be rotated so as to make at least one of its 
columns vanish. 

The value of c 2 is the sum of the three 2-rowed principal minors in (15), 
and GI is the sum of the three 1-rowed principal minors, which is merely the 
sum of the three diagonal terms. The coefficient of /3 3 is unity. We then 
have the following values for the coefficients: 

cz = , c 2 = +4.4464 , d = +4.41 . 
The expansion (16) can then be written as follows: 

(17) p + 4.4100/3 2 + 4.4464/3 + = 0. 

The roots of equation (17) are all real. One of the roots is zero. Dividing 
by /?, we have the quadratic 

(18) /3 2 + 4.4100/5 + 4.4464 = . 

The two roots of this equation are 2.849690 and 1.560310. Let these 
roots be designated by subscripts in the order of their numerical magnitude, 
namely, 

= -2.849690, 



(19) 



ft = -1.560310, 
& = 0. 



Substituting fa in the three simultaneous equations whose coefficients are 
shown in (13), we get three simultaneous equations, 



(20) 



' - .899690Xu - 1.07X 2 i + .69X 3 i = 0, 
- l.OTXu - 2.099690X 2 i + .11X 31 = , 
+ .69X U + .HX 2 i - 1.139690X81 = . 
Solving for the ratios of X n , X 2X , X 3 i, and normalizing them so that 

(21) Mi + Xli + X!i = 1 , 
we have 

Xn = + .804972, 
X 2 i = - .386636 , 

(22) XBI = + .450036, 



1= 1.000000. 
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These are the direction cosines of a unit reference vector which lies in the 
major principal axis of the system of seven points. 

The second root, &, is then substituted in (15), and the same procedure 
gives the following values for the direction cosines of the unit reference vec- 
tor A, which lies in the mean principal axis of the system: 



(23) 



Xu = ~ .257498, 

X 22 = + .455692, 

X 32 = + .852080 , 

^2 = +1.000000 . 



The third root, /3 3 , is zero. The fact that the third root vanishes means 
that the sum of the squares of the projections of the seven test vectors on 
the minor principal axis is zero. Hence the projection of each of the seven 
tests on that axis is zero. Substituting /3 3 = in (13), we obtain, by the same 
procedure as before, the values for the direction cosines of the unit reference 
vector A 3 , which lies in the minor principal axis of the system. These values 
are as follows: 

Xis = - .534522, 

X 23 = - .801784, 

(24) X 33 = + .267261 , 



2> 



= +1.000000. 



The direction cosines of the three principal axes of the system are ar- 
ranged in Table 3 to form the orthogonal transformation of the original 



TableS 





Ai 


As 


A3 


1 

2 
3 


+ .804972 
-.386636 
+ .450036 


-.257498 
+ .455692 
+ .852080 


-.534522 
-.801784 
+ .267261 



co-ordinates of the matrix in Table 2 to the co-ordinates that refer to the 
principal axes. This table corresponds to Table 1 for the general ease. 
The fact that the new co-ordinates are orthogonal is verified by the fact 
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that the matrix in Table 3 is orthogonal by columns. The correlations be- 
tween the three principal axes can be expressed as follows: 



^31X32 , 



(25) 



Substituting the numerical values of Table 3 in (25), it is seen that the three 
intercorrelations are zero. Table 3 is the matrix of an orthogonal transfor- 
mation. It must be orthogonal by rows and by columns; and its determinant 
must equal +1, since the matrix represents a rotation without reflection. 
These properties may be used as a check on the arithmetical work. 

It is now possible to write the rotated form of F with the principal axes 
as co-ordinate axes. The new co-ordinates are shown in Table 4- The third 
column vanishes because one of the roots of the characteristic equation is 
zero. The communalities are listed in a separate column. They remain in- 
variant under rotation. 

Table 4 



Test 


r jM 


VAi 


r /A 


A2 


1 
2 
3 

4 
5 
6 

7 


+ .659828 
+ .830332 
- .541290 
- .126124 
H- .437356 
+ .637638 
+ .904489 


-f .120945 
+ .265611 
-f .637969 
-h .770774 
+ -590526 
- .336776 
+ .109084 


.00 
.00 
.00 
.00 
.00 
.00 
.00 


.45 
.76 
.70 
.61 

.54 
.52 

.83 


I>fc 

y-i 


2.849689 


1.560312 


.00 


4.41 



The three factor loadings of the first test in Table 4 are obtained from the 
following equations: 



(26) 



. + 012X21 + #13X31 = + . 659828 , 
= #11X12 + #12X22 + #13X32 = + 120945 , 
= #11X13 + ai 2 X 23 + #13X33 = ,000000 . 



The intertest correlations of the seven tests may be obtained either from 
the three factor loadings of Table 2 or from the two factor loadings of 
Table 4- All of the intertest correlations are summarized in Table 5. 
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It is of interest to note that the sums of the squares of the factor loadings 
in the columns of Table 4 are identical with the roots of the characteristic 
equation with reversed sign. 

The present problem was set up so that one of the values of p would be 
zero. This was done by writing the loadings in Table 2 so that the seven 
points were in the same plane. The points all satisfy the equation of an ar- 
bitrarily chosen plane, namely, 2x+3y 2 = 0. In actual practice it is not 
likely that one of the values of p will be zero, but it may be very nearly 

Table 5 





1 


2 


3 


4 


5 


6 


7 


1 




+ .58 


-.28 


4-. 01 


-f 36 


+ 38 


+ .61 


2 


+ 58 




- 28 


+ 10 


+ 52 


4- 44 


+ .78 


3 


- 28 


28 




+ 56 


+ 14 


- 56 


- 42 


4 


+ 01 


4- 10 


+ 56 




+ 40 


- 34 


-.03 


5 


+ .36 


f .52 


4-. 14 


4-. 40 




4-. 08 


+ .46 


6 


-f 38 


+ 44 


56 


- 34 


4- 08 




+ 54 


7 


+ 61 


+ 78 


42 


03 


4- 46 


+ 54 





















zero. The fact that one or more of the roots of the characteristic equation 
are zero proves that the tests may be described in terms of less than r com- 
mon factors. These common factors may be chosen to be statistically in- 
dependent or dependent. 

Hotelling's special case 

The method of principal axes is applicable for any diagonal values that 
preserve the Gramian properties of the correlation table. If reliabilities are 
recorded in the diagonal cells, or any other values greater than the com- 
munalities, the rank of the correlational matrix R will, in general, be equal 
to the number of tests. The centroid method will then give a factorial ma- 
trix F with as many columns as there are rows. This means that as many 
common factors are postulated as there are traits. Such a factorial matrix 
can be rotated by the method of principal axes so that the orthogonal ref- 
erence vectors lie in the principal axes of the system of n points. 

It has been shown in the previous chapter that in the special case where 
unity is recorded in the diagonals of the correlational matrix the centroid 
method gives a square factorial matrix F of n columns and n rows which re- 
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produces exactly all of the experimentally obtained correlation coefficients 
in Bo. The co-ordinate axes of this matrix may also be rotated into the 
principal axes of the system. 

Hotelling has discussed this special case of the method of principal axes 
in which unity is recorded in the diagonals of R Q . He has called this special 
case the "method of principal components." The principal components are 
the projections of the trait vectors on the principal axes in the total factor 
space. He has described an ingenious iteration method by which the pro- 
jections of the vectors on the principal axes in the total factor space may be 
found directly from the given coefficients in R Q . Unfortunately, this ingen- 
ious solution is not useful because it is subject not only to the fundamental 
limitations of the principal axes but also to additional limitations. The addi- 
tional difficulties with Hotelling's case may be described as follows: To re- 
cord unity in the diagonal cells of Bo implies that the total variance of each 
trait is to be described by common factors. It has been shown that the 
intercorrelations of n traits can always be accounted for exactly by n com- 
mon factors. This can be done with the diagonal method described in chap- 
ter a, by the centroid method of chapter Hi with unity in the diagonals, or 
by the principal axes method with unity in the diagonals. Any solution in 
which the intercorrelations of n tests are accounted for exactly by n com- 
mon factors must be an artifact as far as the psychological problem is con- 
cerned, because it is definitely known that each test has some unique vari- 
ance. Three sources of unique variance may be listed, namely, (a) the 
chance errors in the test scores, (6) the specific factor in each test, and (c) 
the sampling errors in the correlation coefficients. Hotelling's case assumes 
that the tests are free from chance errors in the scores, that specific factors 
are absent, and that sampling errors are absent. This may be seen by con- 
sidering the fact that his procedure gives a factorial matrix of n common 
factors which accounts for the coefficients exactly without any specific or 
unique variance whatever. As far as the psychological problem is concerned, 
such a solution is not acceptable. 

In addition to these difficulties there must be considered the difficulties 
of the general principal axes solution which have been described in this 
chapter. These apply also to Hotelling's case. It is, of course, desirable 
that the axes of reference in terms of which the tests are described shall 
have psychological or genetic meaning. Consider any single test, such as a 
test of numerical manipulation. If this test is included in a battery which 
contains only a few number tests but many verbal tests, it is clear that the 
major principal axis will pass closer to the verbal tests than to the number 
tests. Now consider the same test when it is placed in a battery which con- 
tains only a few verbal tests but many number tests. The major principal 
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axis of this system will pass closer to the number tests. The factorial de- 
scriptions of the particular number test will be different in the two sets of 
computations, depending on the tests which are chosen arbitrarily for com- 
bination in a battery with the number test. It is not to be expected that 
such a factorial description should give psychologically meaningful axes 

Table 6 





CENTROID CO-ORDINATES 








2 




I 


II 


III 


IV 


A IV 


10 


.642 


.443 


-.150 


-.107 


.6424 


2 


.579 


.499 


-.090 


-.057 


.5956 


5 


.561 


.449 


-.041 


-.126 


.5339 


3 


.712 


.228 


.092 


.121 


.5820 


4 


.633 


.134 


.061 


-.076 


.4281 


1 


.685 


.159 


.157 


.224 


.5693 


8 


.529 


- .144 


.207 


.134 


.3614 


7 


.559 


- .146 


.233 


.077 


.3940 


9 


.546 


- .222 


.162 


-.257 


.4397 


6 


.585 


- .293 


.274 


-.208 


.5464 


15 


.475 


- .112 


-.132 


.083 


.2625 


14 


.428 


- .235 


-.149 


-.126 


.2765 


17 


.619 


- .303 


-.194 


-.071 


.5176 


11 


.598 


- .313 


-.272 


.259 


.5966 


18 


.436 


- .084 


-.099 


.111 


.2193 


s 


8.587 


.060 


.059 


-.019 


6.9653 


2k* 


5.011317 


1.183860 


.428619 


.341573 


6.9654 



Table 7 





1 


2 


3 


4 


1 


5.011317+0 


.165747 


,076365 


.015854 


2 


.165747 


1.183860+0 


-.053695 


-.040502 


3 


.076365 


- .053695 


.428619+0 


- .044782 


4 


.015854 


- .040502 


- .044782 


.341573+0 



of reference. This fundamental limitation is applicable also to the 
centroid method if any attempt is made to interpret the centroid co- 
ordinates directly without rotation. The purpose of the centroid method 
is merely to obtain a factorial matrix which accounts for the observed corre- 
lations within experimental errors and with the smallest possible number of 
common factors. The number of common factors is shown by the number 
of columns of F as found by the centroid method. Hotelling's iteration 
method might be used for the same purpose if it could be modified so as to 
use communalities instead of unity in the diagonals of RQ. As with the cen- 
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troid co-ordinates, a further rotation would be necessary in order to obtain 
a stable and fundamentally significant factorial description of the tests. 
No method is acceptable in this problem which distorts the rank of the cor- 
relational matrix in the common-factor subspace. These considerations 
make it necessary to discard the method of principal axes and also Hotel- 
ling's special case of this method as solutions to the psychological factor 
problem. 

The principal axes of a battery of fifteen psychological tests 

In the previous chapter a battery of fifteen tests by Brigham was used as 
a numerical example of the centroid method of factoring the correlational 

Table 8 





Ax 


A* 


As 


A4 


Xi 


.999 


-.041 


.015 


-.012 


X 2 


.043 


.996 


-.047 


.070 


X 3 


.016 


-.072 


-.909 


.411 


X 4 


.003 


-.045 


.414 


.909 



Table 9 





I 


II 


III 


IV 


10 


.658 


.431 


.081 


-.136 


2 


.598 


.482 


.043 


-.061 


5 


.579 


.433 


-.028 


-.107 


3 


.723 


.186 


-.034 


.155 


4 


.639 


.107 


-.084 


-.042 


1 


.694 


.109 


-.047 


.271 


8 


.526 


-.186 


-.118 


.190 


7 


.556 


-.189 


-.165 


.149 


9 


.538 


-.244 


-.235 


-.189 


6 


.576 


-.326 


-.313 


-.104 


15 


.468 


-.125 


.167 


.008 


14 


.415 


-.235 


.101 


-.197 


17 


.602 


-.310 


.170 


-.173 


11 


.580 


-.328 


.378 


.095 


18 


.431 


-.099 


.146 


.049 



matrix. The same data will here be used as a numerical example of the 
principal axes. Table 6 contains the first four columns of Table (#5-iii) and 
also the communalities for the first four centroid factors. The resulting coef- 
ficients of (13) are shown in Table 7. The expanded form (14) is as follows: 



(27) /3 4 + 6.965369/3* + 10.8104940 2 + 5.407203/3 + .840052 = 
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The four roots of this equation are 

ft = -5.0197117 
ft = -1.182688 
j8 = - .444971 
& =- .317999. 

Substituting each of these four roots in (13) gives, after normalizing, the 
direction cosines of the four principal axes. These are listed in the four 
columns of Table 8. This table represents an orthogonal transformation L 
by which F is rotated into the principal axes. The factorial matrix FL is 
shown in Table 9. This matrix represents the same test configuration as the 
given Table 6. The only difference is that in Table 9 the fifteen test vectors 
are described in terms of their projections on the principal axes, while in 
Table 6 they are described in terms of their projections on the arbitrary 
orthogonal axes of the centroid method. 



CHAPTER V 
THE SPECIAL CASE OF RANK ONE 

The intercolumnar criterion 

The case of rank 1 is of special interest because it is the case to which 
Spearman and his students have given so much study. This is also the case 
which has been the subject of controversy during the past thirty years. 
Practically all of the scientific publications on the factor problem have been 
restricted to Spearman's special case of rank 1. It is only within the last 
few years that the more general case of the factor problem has been studied 
in which the rank exceeds 1 and in which any number of factors are treated 
analytically. Now that the factor problem has been generalized a step be- 
yond the case of Spearman, it is of some interest to interpret a few of the 
old issues in a new light. The single-factor methods of Spearman may be 
interpreted in terms of the matrix formulation of the factor problem. 

One of the earliest methods of Spearman was to ascertain the correlation 
between pairs of columns of R . Ideally, this correlation should be unity if 
the given correlations can be accounted for by a single common factor. It is 
a well-known property of determinants that if the rank is 1, then the col- 
umns are proportional and hence the intercolumnar correlations are unity. 
This property is stated in the following theorems. 

Theorem 1. // the correlational matrix is of rank 1 } then all pairs of col- 
umns, or rows, are proportional. 

The converse of this theorem is also true, for if all pairs of columns are pro- 
portional, then all minors of second order or higher vanish, and hence the 
rank must be less than 2. The trivial case of rank is here of no significance. 
The case in which the rank is is, of course, identified by the fact that all 
the intercorrelations are 0. That is a case of no scientific interest. We have, 
therefore, the following converse theorem: 

Theorem 2. If all pairs of columns of the correlational matrix are propor- 
tionalj then the rank of the matrix is 1 or 0. 

If a pair of columns are proportional, then the correlation between the 
columns is, of course, +1, so that we have the following theorem: 

Theorem 3. // the correlational matrix is of rank 1 then the correlation 

between any pair of columns is +1 or 1. 

The converse of Theorem 3 is not necessarily true. A specific case which 
disproves the converse is as follows : Let the coefficients in a pair of columns 
be such that when one is plotted against the other, a linear plot is obtained 
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which does not pass through the origin. The correlation would be +1, but 
the coefficients in the two columns would not be proportional. 

Spearman's former use of the intercolumnar criterion depended on the 
converse of Theorem 3, in that the high correlation between columns was 
the basis for the inference that a single factor was sufficient to describe the 
intercorrelations, i.e., that the rank of R Q was 1 within sampling errors. 
While the intercolumnar criterion is demonstrably fallible, it should be use- 
ful for rank 1, because it would be a rare situation in which a set of mental 
tests would satisfy the criterion when the rank was higher than 1. 

Another type of difficulty appeared with the intercolumnar criterion. If 
all the coefficients in R Q are of the same order of magnitude and if these are 
overlaid with sampling errors, then the dispersion of a column may be com- 
parable with the sampling errors, and the correlation between columns may 
be low because of the restricted range of the entries in the correlation table. 
The proportionality would still be maintained within sampling errors, but 
the points in the correlation table would be so restricted in range that the 
correlation coefficient would not reveal the proportionality. The intercolum- 
nar proportionality criterion is therefore superior to the intercolumnar corre- 
lation criterion. 

The limiting case of this effect is of some interest. If all of the coefficients 
in a correlation table are equal, then the proportionality criterion is satisfied 
but the correlation coefficient is indeterminate. The proportionality cri- 
terion would give the correct inference, namely, that the correlation matrix 
is of rank 1. We have then 



But 

(2) r/* = TH = r k i , 
and hence 

(3) a>fiO,ki = fyiazi = 

(4) o/i a ki = 
It follows from (1) and (4) that 

(5) an = a k i = an = 
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This limiting case is represented in the following theorem: 

Theorem 4. // all of the coefficients r# in a correlational matrix are equal, 
then the matrix is of rank 1 and each test has a single factor co-ordinate 

of T/V 

If sampling errors are superimposed on this limiting case, the correlation 
between columns shows only the correlation between random errors. This 
correlation should be or near 0. The intercolumnar proportionality cri- 
terion is still valid, and it would be only slightly affected by the sampling 
errors in a finite test battery. 

Spearman's use of the correlational, rather than the proportionality, 
form of the intercolumnar criterion was determined, probably, by the fact 
that the standard error of a correlation coefficient can be determined, 
whereas the proportionality form of the criterion would require the de- 
velopment of an appropriate standard error formula. There does not seem 
to be any fundamental difficulty in doing so. 

Graphical method for rank 1 

The object of the factor problem is to find the factorial matrix which for 
rank 1 is a single column containing the one factor loading or co-ordinate 
for each test. The previous theorems suggest a simple graphical method of 
examining R Q . If the columns are proportional, then the plot of any column 
against any other column is linear through the origin. Let k and I designate 
any two columns, and let j designate any row of RQ. Then if r^ is plotted 
against r/z, the linear plot should be of the form 

(6) TV* = CTVZ , 

where c is the slope constant. Substituting (1) in (6), 

(7) a>]ia>ki = c a 3 ian , 
or 

(8) C = ^H. 

an 

Since c is the slope of the plot, we have the following theorem: 

Theorem 5. If the rank of the correlational matrix is l y and if any column k 
is plotted against any other column 1, then the plot is linear through the 
origin with a slope which is the ratio of the single-factor loading of test 
k to that of 1. 

Even if the tests are overlaid with sampling errors, this ratio is quite stable, 
and it may be determined by any suitable method of curve fitting. The 
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simplest method is probably the method of averages in which a line is drawn 
through the origin and through the centroid of the plot. If the coefficients 
deviate appreciably from ; the slope of this line is not markedly affected by 
sampling errors. Hence this simple method should be useful in examining a 
table for rank 1. While this method is useful for examining a correlational 
matrix, it is not recommended for obtaining the single-factor loadings. A 
simpler and more direct method of solving the single-common-factor prob- 
lem is described later in this chapter. 

If the plot is not linear, or if the points scatter badly, the correlation 
table is of rank higher than 1, and Spearman's single-factor methods do not 
apply. If single-factor methods are to be used, the next step would be, no 
doubt, to try to find a subgroup which would give a linear plot through the 
origin. The intercorrelations of such a subgroup of tests could be accounted 
for by a single factor. 

The tetrad difference 

Spearman's present method is to evaluate what are called "tetrad differ- 
ences." The tetrad difference is of the form 

(9) r km ri n - ri m r kn = p , 

where k and I refer to two rows, while m and n refer to two columns of R Q . 
The four subscripts refer to as many tests, and it is implied that four sepa- 
rate tests are involved in the tetrad-difference equation. Hence the tetrad 
difference is not written so as to include any diagonal terms of R Q . This is 
consistent with the fact that the communalities are unknown. Spearman 
has shown that if only one factor is involved, then all the tetrad differences 
in RQ vanish. 

The tetrad differences have a very simple matrix interpretation. They 
are simply the expansions of second-order minors in the correlation table. 
If the rank of R Q is 1, then all second-order minors vanish. The converse is 
also true, for if the second-order minors vanish, then the rank must be 1, 
except for the trivial case when all entries are 0. The matrix interpretation 
of Spearman's tetrad-difference procedure is that rank 1 (i.e., a single com- 
mon factor) is established by evaluating separately the second-order minors 
in the correlational matrix. One might speculate as to whether multiple- 
factor analysis would have developed earlier if this interpretation had been 
stated before. If the second-order minors must vanish in order to establish 
a single com. on factor, then must the third-order minors vanish in order 
to establish two common factors, and so on? To have put the matter in 
this way would have led to the matrix formulation of the problem. 

To establish that a matrix is of any particular rank r, it is of course 
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necessary to prove that r is the highest order of the non-vanishing minors. 
Taken literally, this requires that all minors of order higher than r must be 
shown to be 0; but for computational purposes this is probably the most 
awkward way possible, especially for the single-common-factor case. 

The tetrad-difference method of examining a correlation table cannot be 
recommended even for the restricted single-common-factor case to which 
it is theoretically applicable. The reason is that more effective methods are 
available for ascertaining whether one common factor is sufficient to ac- 
count for the intercorrelations. If more than one factor is required, then the 
tetrad-difference criterion is not applicable. Some of the properties of the 
tetrad differences will be described here because of the fact that this way 
of ascertaining whether a correlation table is of rank 1 is in general use. 
There is considerable interest in the tetrads among students of factor theory. 

Because of the great amount of labor that is involved in the-computation 
of the tetrads for a large correlation table, it is convenient to know how 
many tetrads must be evaluated for n tests in order to cover the whole 
table. Since a tetrad difference is the value of a second-order minor which 
does not contain diagonal terms, there are as many tetrads as there are 
second-order minors which do not involve the diagonals. Each of these 
minors is defined by two rows and two columns. The number of pairs of 
rows that can be taken is the number of combinations of n things taken two 
at a time, or 

(10) CS = - 



The number of possible pairs that can be taken from the remaining columns, 
since diagonal elements are excluded, is then 

_ (n-2Xn-8) 

Hence the total number of second-order minors in the correlation table, ex- 
cluding diagonals, is 

(12) CR3T- - n(M " 1)(2)(ra ~ 3) - 



But since the correlational table is symmetric, it follows that every one of 
these minors is duplicated by a symmetric minor of the same value. Hence 
the total number of different tetrads is 

/io\ "Lr&rF 1 -- 2 n(n ){n ^)(n oj 

Uw fC2t/ 2 5 
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Since every set of four variables gives three tetrads, it is possible to obtain 
the same result by considering the number of combinations of n things taken 
four at a time. Then the number of tetrads is 

(14) 3C? = *(n-l)(n-2)(n-8) . 

O 

Example : If the number of tests is 20, then the correlation table contains 
14,535 tetrads. 

When the computation of all these tetrads has been made, the result is 
usually that the tetrads do not vanish. The inference must then be made 
that one common factor is insufficient to account for the intercorrelations 
of the tests. The question as to which of the tetrads will vanish and which 
of them will not vanish, and the question whether one common factor is 
sufficient, can be answered more easily by the other methods of this chapter. 

If it is found that the tetrads do vanish within the sampling errors, then 
the next problem is to ascertain how much of the variance of each test is 
attributable to the single common factor. This can be done in terms of the 
correlation coefficients, as follows: 

Consider the single-factor expression for the intercorrelations of any three 
tests j, k, and I. Then 
(15a) TV* 



(156) rn 

(15c) r kl 

In order to find the loading a,i of test j with the single common factor, di- 
vide (15Z>) by (15c). Then 

/i c\ r i l a i l 

(16) , 

so that 

(17) a a 

Substituting (17) in (15a), 

(18) r ik ^ , 

from which we have Spearman's formula* for the correlation of test j with 
the single common factor, namely, 

(19) a,i = 



* C. Spearman, The Abilities of Man (New York: Macmillan Co., 1927), Appendix, 
p. xvi, equation (19). 
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The value of (19) is subject to fluctuation with the sampling errors of the 
three coefficients in terms of which it is expressed. It is desirable to mini- 
mize this effect by taking an average value for a,i, based on different pairs of 
tests k and I with which test j is combined. With n tests, the number of 
ways in which (19) can be written for test j is the number of pairs of tests 
that may be taken from the remaining (n 1) tests, excluding test j. Hence 
the number of ways in which (19) may be written is 



Since there are n tests, the total number of formulae (19) for ascertaining 
the single-factor loading of all the tests is %n(n l)(n 2). 

Example: In order to ascertain the correlation of each of 20 tests with 
the single common factor by all the determinations in JJ , formula (19) 
would be evaluated 3,420 times. 

Spearman's procedure takes into consideration that the tetrad difference 
p in (9) does not quite vanish because of sampling errors in the four corre- 
lation coefficients. If a single common factor is fundamentally present and 
if the four coefficients have known standard errors, an expression for the 
standard error of p can be derived. This has been done by Wishart and by 
Holzinger.* The experimentally observed deviations of p from zero should 
not exceed those which might be expected from the standard errors of p. 
This is the central idea in Spearman's single-common-factor method. The 
tetrads in a correlation table are first evaluated. A frequency distribution 
of these tetrad differences is then made and its standard deviation deter- 
mined. If this dispersion is of an order of magnitude comparable with that 
which would be expected from the known standard errors of the tetrad 
differences, then Spearman draws the legitimate conclusion that a single 
common factor is sufficient to account for the observed intercoraelations. 
Applications of formulas of the type (19) give the loading of each test with 
the single common factor whose sufficiency has been established by the 
fact that the tetrads vanish within sampling errors. 

It must be borne in mind that the vanishing of the tetrads, i.e., rank 1, 
does not prove the existence of a single common factor in the sense of a 
mental ability or in a genetic sense. This can be seen by considering the 
case where the r factor loadings are in the same proportion in all of the 
tests. All of the test vectors are then collinear in a common-factor space of 

* John Wishart, "Sampling Errors in the Theory of Two Factors," British Journal 
of Psychology, XIX (1928), 180-87. 

Karl J. Holzinger, Statistical Resume of the Spearman Two-Factor Theory (Chicago: 
University of Chicago Press, 1930), pp. 6-16. 
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r dimensions, although their scalars may be different because of differences 
in the specific variances and in the error variances of the tests. This con- 
tingency is illustrated by the following fictitious factorial matrix of five 
tests and two factors. 

If the matrix F of Table 1 is multiplied by its transpose F', it will be seen 
that the columns in R are proportional and that it is of rank 1. The tetrads 
vanish, and the intercorrelations of the tests can be described as well by 
one factor, as shown in the single-column matrix of Table 2. This example 
violates the postulate on page 57 in chapter i. 

Table 1 Table 2 





I 


II 


1 


.60 


.30 


2 


.40 


.20 


3 


.80 


.40 


4 


.20 


.10 


5 


.30 


.15 





I 


1 


.6708 


2 


.4472 


3 


.8944 


4 


.2236 


5 


.3354 



The reason why this result is obtained is that Table 1 is of rank 1. It cor- 
responds to the conceivable psychological situation in which each test of a 
battery calls for two primary mental abilities in the same ratio although 
they differ in specificity and reliability. In practice, it is possible to select 
from a large table of tests several groups whose intercorrelations are high 
when corrected for uniqueness. Each one of these groups of tests can be 
described in terms of one factor, but that factor is not necessarily psycho- 
logically significant. The tests may be composites, as illustrated in Tables 1 
and #. One way of avoiding this ambiguity is to work with several abilities 
or factors simultaneously, as is done in the multiple-factor methods, and to 
insure that a large number of zeros occur in each column of the table. This 
is one of the fundamental ideas developed in the subsequent chapters. 

In order to reduce the labor of computing probable errors of the tetrads, 
Spearman and his students have developed several abbreviated procedures. 
These are all limited, however, to the single-common-factor case. 

Graphical analysis of tetrads 

Although the tetrad method of examining a correlation table is not rec- 
ommended, there is still so much interest in tetrads among students of fac- 
torial analysis that a simple graphical method of selecting the vanishing and 
the non-vanishing tetrads will be described. By Theorem 5, a plot of any 
column k against any other column I is linear if the rank of Bo is 1, i.e., if the 
intertest correlations can be described by a single common factor. If the 
test battery as a whole cannot be described by a single common factor, the 
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plot will not be linear but will scatter, as shown in Figure 1. This figure 
shows the plot of column 9 against column 1 from Table II of a recent fac- 
torial investigation by William Brown and William Stephenson.* Column 9 
represents a test of pattern perception, and column 1 represents a test of 



i.oo 



.60 



.60 



.40 



.20 




.00 



.60 



1.00 



.20 .40 ,60 

TE5T 1 
FlGUEE 1 

inventive synonyms. Although it is evident in Figure 1 that a single factor 
is not sufficient to account for the intercorrelations and that, therefore, all 
of the tetrads do not vanish, it is still possible for smaller groups of tests to 
be of such a character that their intercorrelations can be described by a 
single common factor. Their tetrads should then vanish. The smallest group 
for which a tetrad can be written is four tests. Two tests are represented by 
the two columns that are plotted. Any pair of points on the diagram de- 
termines a tetrad. If they lie in a radial line through the origin, the corre- 

* "A Test of the Theory of Two Factors/' British Journal of Psychology (General Sec- 
tion'), XXIII (1933), 352-70. 
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spending tetrad vanishes. If they do not lie in a radial line through the 
origin, the corresponding tetrad does not vanish. 

Let two points on the diagram represent tests m and n. If the two points 
are radial, then 



so that 

(21) T km Ti n - ri m r kn - . 



If the points m and n are not radial, the proportionality of (20) does not ob- 
tain and the tetrad (21) does not vanish. 

A few numerical examples will be shown from Figure 1. A radial line 
can be drawn through the points 12 and 19. Hence the tetrad determined 
by these two points vanishes. The tetrad is as follows: 

ru.iriB.9 - r.iiri9.i = (.345) (.549) - (.401) (.489) - - .007 . 



A radial line cannot be drawn through the points 4 and 5, and hence the cor- 
responding tetrad does not vanish. The tetrad is as follows: 

nir 5 9 - r 49 r 6 i = (.656) (.655) - (.516) (.373) = + .237 . 

It may be of some interest to examine the tests, which are as follows : 
1) Synonyms, 

4) Disarranged sentences, 

5) Fitting shapes, 

9) Pattern perception. 

This group of four tests probably has a common visual-perception factor in 
tests 5 and 9 which is not identical with the verbal factors in 1 and 4. 
The previous group is as follows: 
1) Synonyms, 
9) Pattern perception, 
12) Mutilated pictures, 
19) Arithmetical equations. 

Here there is either one common factor or several common factors whose 
factor loadings are in the same proportions in the four tests. It is probably 
significant that the battery contains only one arithmetical test. Any num- 
ber factor in this test will remain specific in the battery as a whole, and 
hence it can have no effect on the vanishing of the tetrads, 

The graphical method which has been described here can be used to indi- 
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cate, without numerical calculation, which of the tetrads will vanish almost 
exactly and which of them will have large residuals. It is possible that this 
graphical method could be extended so as to represent the probable errors 
of the tetrads; but since the tetrad method is not recommended even for the 
single-common-factor case, such a development would not be useful. 

A single-factor method without tetrads 

If a correlation table is of rank 1 the single-common-factor loading of 
each test may be determined by a summation procedure. This method is 
simpler than the tetrad method for a single common factor, and it gives 
more information about the variables than the tetrad differences give. 

If the correlation r# can be described by a single common factor, we have 

(1) Tj k = ajidki , 

The sum of column k in the correlation table is 
(22) 



Since the diagonal terms are unknown, the summation in the left member 
of (22) is unknown, and hence not suitable for computing purposes. Let 
(r)fc be the sum of column fc, omitting the unknown diagonal entry. Then 



(23) 

and hence 

(24) 



(r) k = 



(r)* = 



y=i 



_ 

k=l 



Summing for all given coefficients in RQ and omitting the unknown diag- 
onals, we have 



(25) 

or 

(26) 



where (r) t denotes the sum of all the coefficients in J? except the diagonal 
ones. 
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Each of the two terms in the right member of (26) may be expressed in 
terms of summations of known coefficients, as follows: 

(1) r = 

and hence 

(27) r 

Summing for column k, 

<28) s ,, 

and from (23), it follows that 

(29) ^ [(r) k + ojj = 

y=i 

or 



(30) + au. 

kl 

Hence the first term of the right member of (26) is 



(31) 



" n ~]2 

^qjbi = ^r 



2(r) 



The last term of (26) may be expressed in terms of summations of coeffi- 
cients, as follows: 

(1) r,t, 

so that 

(32) r* k 

and hence 
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Summing for column k, 

(34) 

Summing for column k, except for the entry in row k, 

(35) ^ 



Let the sum of the squares of the known coefficients in column k be de- 
noted by (r 2 )*, so that 



(36) (t*) k m 

j-i 
Then (35) can be written in the form 

(37) ^ + oJi = 
Substituting (31) and (37) in (26), 

(38) (r) = |r + 2(r) t + (& - ^r 
or 

/OQ\ fn,\ ___ o/>\ V./ / Jk __^ v yfe 

flfcl Ofcl 

from which it follows that 



- tt - (r) ( - 2(r) 4 ' 

where 

(r)* = the sum of the coefficients in column k omitting 
unknown diagonal entry, 

(r)| s the square of (r) kj 

(r 2 )^ 5 the sum of the squares of the known coefficients 
in column k> 

(r) t 33 the sum of all known coefficients in the correla- 
tion table. 
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Formula (40) gives the single-common-factor loading of each test in a 
correlation table of rank 1. This formula has been given, in different nota- 
tion, by Spearman.* In order to ascertain how well the single common fac- 
tor accounts for the intercorrelations, a table of residuals should be com- 
puted. These residuals are defined as follows : 

(41) Tjk a/idM = pjk , 

where p# is the discrepancy between the given coefficient and that which is 
determined by the single-common-factor loadings of tests j and k. In order 
to ascertain which tests deviate most from the single-common-factor hy- 
pothesis for the whole battery, the mean absolute discrepancy for each test 
might be determined. It would be denoted 



(42) 



for column fc. This absolute mean discrepancy is a direct measure of the 
agreement between the given coefficients and the hypothesis of a single 
common factor. Hence it is superior to the tetrad differences, which con- 
stitute an indirect measure of the agreement. 

A frequency distribution of the residuals may be made, and its dispersion 
may be compared with the standard error of the mean coefficient. A more 
formal treatment of the data would be to determine 



(43) 



for each coefficient where E,-* is the ratio of the residual to the standard error 
of the given coefficient. A frequency distribution of these ratios should have 
a standard deviation not appreciably greater than unity in order to estab- 
lish a single common factor as sufficient to account for the given correlation 
coefficients. 

Numerical example of single-factor method 

An application of formula (40) has been made to a problem which has 
been used by several students of factor theory. Tabk 8 contains the inter- 

* C. Spearman, The Abilities of Man (New York: Macmillan Co., 1927), Appendix, 
p. xvi, equation (21). 



148 



THE VECTORS OF MIND 
Table 3 
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5 
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.51 


.48 


.43 


.43 


.46 
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.58 




.47 


.50 


.53 


.40 


.44 


.45 


3 


.58 


.47 




.51 


.48 


.41 


.45 


.38 
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.51 


.50 


51 




.34 


.50 


.35 


.40 


5 


.48 


.53 


.48 


.34 




.33 


.41 


.39 


6 


.43 


.40 


41 


.50 


.33 




.41 


.33 


7 


.43 


.44 


45 


.35 


.41 


.41 




.31 


8 


.46 


.45 


.38 


.40 


.39 


.33 


.31 




W* 


3.47 


3.37 3 


28 


3.11 


2.96 


2.81 


2.80 


2.72 


Wi 


12.0409 


11.3569 10 


7584 


9.6721 


8.7616 


7.8961 


7.8400 


7.3984 


(r 2 )* 


1.7447 


1.6443 1 


.5628 


1.4183 


1.2864 


1.1489 


1.1358 


1.0756 


<4 


.585677 


.546265 


.512004 


.451027 


.401892 


.356995 


.354345 


.331384 


<zjfei 


.765295 


.739097 


715545 


.671585 


.633950 


.597491 


.595269 


.575660 



Table 4 
First-Factor Residuals for Table 3 
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+ 014 


+ 032 


004 
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026 


+ 019 


2 


+ 014 




-.059 


+ .004 


+ 061 


042 


000 


+ 025 


3 


+ 032 


.059 




+ 029 


+ 026 


018 


+ 024 


032 


4 


- 004 


+ .004 


+ .029 




- 086 


+ 099 


050 


+ 013 


5 


-.005 


+ .061 


+ .026 


-.086 




.049 


+ 033 


+ 025 


6 


-.027 


-.042 


-.018 


+ .099 


.049 




+ 054 


014 


7 


-.026 


.000 


+ -024 


-.050 


+ .033 


+ .054 




033 


8 


+ .019 


+ .025 


-.032 


+ .013 


+ .025 


-.014 


-.033 
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.018 


.029 


.031 


.041 


.041 


.043 


.031 


.023 
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correlations of eight tests which are reproduced from HoLzinger.* At the 
bottom of the table are recorded the three entries which are required by 
formula (40). The single-common-factor loading for each test is the last 
"entry in each column. In Table 4 are shown the residuals of (41). It will be 
seen that they are small. At the bottom of each column of Table 4 is record- 
ed the absolute mean discrepancy of (42). This example illustrates a single- 
factor method which does not require the computation of any tetrad dif- 
ferences. Each of the entries in Table 4 might be divided by the standard 
error of the corresponding correlation coefficient, and a frequency distribu- 
tion of these ratios might be prepared. Its standard deviation should not 
be much greater than unity. 

* Karl J. Holzinger, Statistical Resume of the Spearman Two-Factor Theory (Chicago : 
University of Chicago Press, 1930), Table 6, p. 32. 



CHAPTER VI 

PRIMARY TRAITS 
Simple structure 

It has been shown that when the inequality (5 ii) is satisfied, there is a 
unique configuration of trait vectors that corresponds to the given correla- 
tional matrix. The correspondence between the configuration and the cor- 
relational matrix is independent of rotation of the orthogonal reference axes. 
The cell entries of F are altered within the range hj under rotation of the 
reference axes. Since the rotation of the reference axes is arbitrary, it is 
clear that the numerical values in F can have no direct interpretation except 
in terms of some criterion which relates the configuration uniquely to the 
reference axes. If such a criterion can be found which satisfies the demands 
of the scientific problem, the reference axes become unique in relation to the 
configuration instead of remaining arbitrary under rotation. 

The multiple-factor problem can be stated in two parts, namely: 

1) What is the minimum number of factors that will account for the 
observed intercorrelations? 

2) What is the minimum number of factors for each trait that will ac- 
count for the intercorrelations? 

The solution to the first of these two problems has been described in the 
previous chapters. The solution to the second problem will supply a unique 
factorial matrix. 

The second problem is, in effect, to find the matrix representation of the 
simplest order among the traits that will account for the intercorrelations 
within the general restriction that the trait measures shall be linear func- 
tions of the several factors. If the traits involve r factors, the most complex 
order admissible in factor theory is that in which every one of the n traits 
involves every one of the r factors. The simplest possible order is that in 
which each trait can be described in terms of the smallest possible number 
of factors. The simplest order has been found when each row of the factorial 
matrix has at least one zero and when the number of zeros has been maxi- 
mized. 

It will be convenient to designate by special names some of the concepts 
that are involved in the problem of finding a unique factorial matrix of 
simplest possible order. The configuration of n trait vectors has r dimen- 
sions. The trait vectors are described in the matrix F by r orthogonal ref- 
erence vectors. 
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Table 1 



II 



Definition: The r orthogonal reference vectors that are implied in the fac- 
torial matrix F will be referred to as the orthogonal reference vectors. 
Definition : When the factorial matrix F has been computed by the centroid 
methodj its orthogonal reference vectors will be called the centroid ref- 
erence vectors. 

Definition: The unique configuration of trait vectors defined by the correla- 
tional matrix will be called the correlational configuration or the trait 
configuration. 

Hence the factorial matrix F describes the trait configuration in terms of 
the orthogonal reference vectors. 

Definition : The combined configuration of the n trait vectors and any set of r 

reference vectors will be called a structure. 
Hence a structure is itself a configuration. 

If the numerical entries in the factorial matrix shall have scientific in- 
terpretation, the reference vectors must have meaning beyond that of an 
arbitrary reference frame for the trait configuration. Each 
reference vector must be interpreted as a scientific cate- 
gory, so that the numerical entries of the factorial matrix 
have scientific meaning with reference to explanatory or 
descriptive categories. It is in this sense that the com- 
bined configuration of the trait vectors and the reference 
vectors constitutes a structure. 

Definition: A structure in which each trait vector is con- 
tained in one or more of the r orthogonal co-ordinate 
hyperplanes will be called an orthogonal simple 
structure. 

It follows from this definition that if a factorial matrix 
represents an orthogonal simple structure, each row of the 
matrix must have at least one zero. If r=3, each trait 
vector lies in at least one of the three orthogonal co-or- 
dinate planes. 

The nature of the second principal factor problem will 
be illustrated in three dimensions. Let Table 1 represent 
a factorial matrix of rank 3. The crosses in the cells rep- 
resent finite values of a imj while the other cells have zero entries. Some of 
the traits can be described in terms of only one factor, while the others have 
two factors each. 

The corresponding configuration is shown in Figure 1, in which the three 
columns of the table are represented by the three reference vectors. The 
nine trait vectors are indicated by number. The trait configuration may be 
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described numerically by reference to any arbitrarily chosen set of co-ordi- 
nate axes. In general, an arbitrary set of axes is represented by a finite posi- 
tive or negative value of a/ m in each cell of F. Orthogonal simple structure 
is shown if a set of axes can be found so that a large number of zeros appear 
in F> with at least one zero in each row. Since each one of the traits can 
be described in terms of fewer than three factors, simple structure can be 
found for the nine vectors of this example. 




FIGTJEE 1 

If an nXr factorial matrix is set up with arbitrary entries in all cells, 
there is, in general, no transformation by which each of the n variables can 
be described in terms of fewer than r factors. It is assumed that n is large 
in comparison with r. Therefore the appearance of simple structure in a 
factorial matrix derived from observation commands attention. It is not a 
chance matter. When found in experimental data, it reveals order within 
the n variables in that r<n categories are required for describing them col- 
lectively and fewer than r categories are required for each one of them sepa- 
rately. When simple structure has been found by an orthogonal transforma- 
tion L upon F, the reference vectors of FL represent fundamental categories 
which must be incorporated into the ideal constructs of the science. 

Definition: If an underlying physical order of the n traits is such that each 
of the traits can be described in terms of a smaller number of factors than 
are required for describing the traits collectively, then the underlying 
physical order will be called a simple order. 
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It is here assumed that for the purposes of any particular scientific study a 
trait is completely described by that- part of its total variance which is 
represented by the observed intercorrelations. Factors additional to those 
which are represented by the rank of the correlational matrix may be in- 
volved in the specific variance of each trait, but these are irrelevant in a 
factorial scientific study of the traits in which the correlational matrix is 
used as a datum. 

Definition : If a simple order exists for a set of n traits and if the if actors are 

statistically independent in the experimental population^ then the cor- 

responding physical order will "be called an orthogonal simple order. 

Hence the configuration which represents an orthogonal simple order among 

the n traits is an orthogonal simple structure. An order among the traits 

involves, of course, not only the traits themselves but also the categories in 

terms of which they are described. These categories are themselves traits 

which may or may not be experimentally isolable. 

It is useful to summarize the several fundamental concepts that are in- 
volved in this analysis. The concept order refers to the relation between the 
traits and the categories in terms of which the traits are to be described 
and comprehended. The correlational matrix describes merely the relations 
among the traits, independently of the descriptive categories. The factorial 
matrix describes the traits or variables in terms of an arbitrary set of de- 
scriptive categories. The trait configuration is a geometrical representation 
of the correlational matrix, and hence it is also independent of the descrip- 
tive categories. The structure is a configuration which represents not only 
the traits but also the arbitrary descriptive categories. The scientific prob- 
lem is essentially a search for a set of descriptive categories in terms of 
which our conception of the traits or variables shall be the simplest possible. 
If an overdetermined and unique simplicity in our conception can be 
achieved, then the traits or variables will be said to reveal a simple order. 
The search for these categories has its direct analytical counterpart in the 
search for a set of reference vectors which shall reveal a simple structure. A 
simple structure is a configurational representation of a simple order. If the 
simplifying descriptive categories happen to be statistically independent in 
the experimental population, then the trait configuration can be so rotated 
in its arbitrary orthogonal frame that each trait vector is contained in one 
or more of the r orthogonal co-ordinate hyperplanes. The result is an or- 
thogonal simple structure) and the reference vectors represent a set of sta- 
tistically independent traits that serve the simplest possible comprehension 
of the given traits or variables. 
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Oblique reference vectors 

It is desirable to define the primary traits so as to describe the trait cor- 
relations in terms of the simplest possible order. Since the primary traits 
may not be orthogonal in the experimental population, the structure of the 
factorial matrix can be simplified by introducing oblique reference vectors. 
In the T dimensions of the common-factor space, each of r oblique reference 
vectors defines a co-ordinate hyperplane of (r 1) dimensions. 

Definition: The subspace of (r 1) dimensions which is orthogonal to the 

reference vector A p will be called the co-ordinate hyperplane Tu p . 
The oblique reference vectors will be referred to merely as reference vectors. 
Orthogonality will be explicitly designated. 

The concept of orthogonal simple structure can be generalized to oblique 
reference axes. 

Definition: If a set of r hyperplanes of dimensionality (r 1) exists such 
that each trait vector is in one or more of the hyperplanes, then the com- 
bined configuration of the trait vectors and the reference vectors will be 
called a simple structure or an oblique simple structure. 
The factorial matrix which describes a simple structure of n traits in terms 
of r oblique reference vectors will have at least one zero in each row. Since 
its reference vectors are oblique, it cannot be obtained from F by an orthog- 
onal transformation. A factorial matrix with oblique reference vectors will 
be denoted V. 

The r reference vectors which are implied by the columns of Fare orthog- 
onal. Rotation of the system by the transformation L produces the matrix 
FL, which describes the same configuration. The r columns of FL also repre- 
sent orthogonal reference vectors. Instead of subjecting F to an orthogonal 
transformation L in the attempt to find simple structure in FL, the new 
reference vectors will here be regarded as oblique. Then the factorial 
matrix F is subjected to some transformation G, not necessarily orthogonal, 
in the attempt to find simple structure in 

(1) FG s V . 

The only restriction upon the transformation G is that it shall be normal- 
ized by columns, i.e., that the sum of the squares of the elements in each 
column shall equal unity. 

Each column of G represents the direction cosines of a reference vector. 
Since F is of rank r, it follows that G will be a square matrix of order r. Let 
A p be the reference vector represented by column p in (?, and let its direc- 
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tion cosines be \i p , X 2 p, . . . , X r;p . The corresponding column in V contains 
the cell entries VJ P , where 

(2) Vjp 



Each entry v,- P is the scalar product of the test vector j and the reference 
vector A p . Simple structure of V is shown if a transformation G can be 
found such that at least one of the entries V JP vanishes in each row j. Each 
hyperplane L p is determined by (r 1) points and the origin. Hence there 
must be at least r trait vectors in each hyperplane in order that it shall be 
overdetennined. 

It will be useful to designate by a special name the number of reference 
vectors that are involved in the linear description of each trait. 

Definition: The number of reference vectors that are involved in the linear 

description of a trait mil be called the complexity of the trait. 
It follows from this definition that in a simple structure every trait is of 
complexity less than r. 

Uniqueness of simple structure 

When reference axes have been found which produce a simple structure, 
it is of considerable scientific interest to know whether the simple structure 
is unique. Consider a set of six trait vectors in three dimensions in which 
no three of the vectors are coplanar. There are fifteen different simple 
structures for this configuration. If there are seven trait vectors in three 
dimensions, no three of which are coplanar, then a simple structure is im- 
possible because there exists no set of reference vectors, either orthogonal or 
oblique, by which a simple structure can be made. 

The necessary and sufficient conditions for uniqueness of a simple struc- 
ture need to be investigated. This is an important problem, because only in 
terms of its solution will it be possible to ascertain to what extent a particu- 
lar simple structure is overdetennined by the experimental data on which 
it is based. In the absence of a complete solution to this problem three cri- 
teria will here be listed which are almost certain to constitute sufficient and 
more than necessary conditions for the uniqueness of a simple structure. 
The scientific interpretation of the cell entries in the oblique factorial 
matrix V should not be attempted except after reasonable assurance that 
the simple structure of V is unique. It is part of the faith of science that if 
several alternative simple structures can be found for the matrix V and if 
each of them can be given plausible descriptive categories, only one of the 
alternatives can eventually remain acceptable. 
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The three criteria by which the r reference vectors can be overdeter- 
mined are as follows : 

1) Each row of V should have at least one zero, 

2) Each column of V should have at least r zeros, 

3) For every pair of columns of V there should be at least r traits whose 
entries VJ P vanish in one column but not in the other. 

The first criterion demands that each trait should be describable in 
terms of fewer categories than are required by the whole set of n traits. It 
is conceivable that, in some experimental work, one or more of the traits 
will be so complex as to require description in terms of all of the factors that 
enter into the traits collectively. For the purpose of isolating the funda- 
mental categories, these traits are not useful, and they should therefore be 
ignored. The criterion demands that the list of traits be long enough so 
that after elimination of several traits of complexity r, enough traits of com- 
plexity less than r remain to determine uniquely both the trait configura- 
tion and the simple structure. This principle may be illustrated with psy- 
chological tests. If one of the abilities to be isolated should be number 
sense, then this primary ability should not be required in all of the tests of 
a battery. The same restriction applies to each of the abilities that are to 
be isolated. 

The second criterion seems to be essential for the following reason. Each 
column p of 7 is determined by a hyperplane L p . A hyperplane through the 
origin is determined by (r 1) trait vectors. These trait vectors are con- 
tained in L p , and therefore they have vanishing entries v 3 - p in column p. 
Therefore there must be at least (r 1) traits with vanishing entries in each 
column of V in order that the hyperplanes shall be determined. Since the 
hyperplanes should be overdetermined by the data, it follows that the num- 
ber of vanishing entries in each column of V should equal or exceed r. 

The third criterion is suggested by the fact that the r hyperplanes must 
be distinct. If two columns of V contain the same vanishing entries and if 
these exceed (r 2) in number, then the two corresponding hyperplanes are 
identical The third criterion was written so as to insure both overdeter- 
mination and distinctness of the hyperplanes that define the columns of V. 

Primary trait vectors 

If a set of r hyperplanes has been found such that each trait vector is 
contained in one or more of the hyperplanes, then the n traits can be com- 
prehended as a simple order in which each trait is of complexity less than r. 
This implies that each row of the factorial matrix V has at least one zero. 
These hyperplanes will be specially designated as follows: 
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Definition: The r hyperplanes whose normals produce a simple structure 
with a trait configuration will be called the co-ordinate hyperplanes 
for the trait configuration. 

Any other set of co-ordinate hyperplanes will be called arbitrary co-ordinate 
hyperplanes. The simple structure is defined by the trait configuration and 
the normals A P to the co-ordinate hyperplanes L p . 

The corresponding co-ordinate axes are denned as follows: 

Definition: The intersection of any (r 1) co-ordinate hyperplanes defines 

a co-ordinate axis of the structure. 

The total number of sets of (r 1) hyperplanes is r, and consequently their 
intersections define r co-ordinate axes. These are of special scientific inter- 
est because they define the descriptive categories of the simple order. 

Definition: The unit vector defined by a co-ordinate axis will be called a 
primary trait vector or a primary vector. 

Definition : The trait which corresponds to a primary vector will be called a 

primary trait or a primary factor. 

The object of a factorial analysis is to discover the primary traits and to 
describe them in terms of the traits that are experimentally observed. 

A simple structure is represented diagrammatically in Figure 2 for three 
dimensions. The hyperplanes of dimensionality (r 1) are planes in this 
special case. They are shown by the arcs L p . The normals A p are also 
shown. These are the reference vectors. Each primary vector T p is the in- 
tersection of the (r 1) hyperplanes L q , where q^p. In three dimensions 
there are three planes L p which contain the origin. Their intersections de- 
termine the primary vectors T P . 

The intersection of all the co-ordinate hyperplanes, excepting L PJ defines 
a primary trait vector which will be denoted T p . Hence T p defines the linear 
subspace which is common to all the hyperplanes, excepting L p . The trait 
vector T p is not contained in the hyperplane L p , but it is contained in all the 
other hyperplanes. It follows that the primary trait T p is absent in all of the 
traits which have vanishing entries v,- p in column p of F. The primary trait 
T p is present in all traits that have non-vanishing entries v 3 - p in column p 
of 7. 

Since the primary trait vector T p is not contained in the hyperplane L p , 
it might be inferred that it is identical with the normal to the hyperplane 
L p . Such is not necessarily the case. If the primary traits are statistically 
independent in the experimental population, then the vectors T p are orthog- 
onal, and so are also the co-ordinate hyperplanes L p and their normals A^. 
In this case the reference vectors Aj> are identical with the primary vectors 
T p . However, if the primary traits T p are not statistically independent in 



158 



THE VECTORS OF MIND 



the experimental population, then the hyperplanes L p are oblique, and their 
normals A p are oblique. The two sets of vectors A p and T p are then, in gen- 
eral, distinct. 




FIGURE 2 

The geometrical interpretation of primary traits may be illustrated in 
three dimensions. Let the entries a/ m in each row of F be augmented by the 
multiplier I/Ay. The geometric representation of the augmented co-ordi- 
nates is that each trait vector is extended to unit length. The augmented 
co-ordinates are therefore the direction cosines of the trait vectors. The 
termini of the trait vectors can be represented as points on the surface of a 
hypersphere. If r=3, the trait configuration can be studied graphically on 
the surface of a ball. 
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Let Figure 3 represent the trait configuration, and let the points repre- 
sent the termini of the trait vectors on the surface of the sphere. Simple 
structure is shown by the fact that each point lies in one of the three arcs of 
great circles. All of the traits on the arc 1-2 can be described by two pri- 
mary factors, since all of the corresponding trait vectors are coplanar. The 
whole set of traits can be described by three factors. Hence the same pri- 
mary factor is absent in all of the traits along 1-2. The subspace 1-2 can 
be described by the direction cosines of the normal to the plane 1-2. Let 
this normal be denoted A 3 . The subscript of A 3 refers to the primary trait 
Tz that is absent in the subspace L 3 . The vector A 3 is the normal which 
defines the subspace L 3 . 




FlGUKE 3 

By analogy, the vector Ai is the normal to the plane 2-3 and A 2 is the 
normal to the plane 1-3. If all trait vectors in the plane 1-2 represent two 
primary factors and if all trait vectors in the plane 1-3 represent two pri- 
mary factors, it is clear that the vector which is determined by the inter- 
section of these two planes represents the primary factor which the. two 
planes have in common, namely, the primary factor No. 1. In the same 
manner the other primary factors, 2 and 3, are determined by intersections 
of planes. 

The direction cosines of the reference vectors A p constitute the columns 
of the matrix of transformation G by which simple structure is demonstrated 
in the traits of Figure 8. 

The reasoning about Figure 3 may be generalized as follows. The matrix 
of the transformation G is shown in (3) in which the r entries of column p 
show the direction cosines of the hyperplane L?. The primary trait vector 
T p is defined by the intersection of (r 1) hyperplanes, omitting I/ p . Each 
column of (3) shows the r coefficients of the homogeneous linear equation 
which defines a hyperplane. If (r 1) of these equations are solved simid- 
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taneously, the ratios of the unknowns are the ratios of the direction cosines 
of the intersection. Normalizing these ratios, we have the direction cosines 
of the primary trait vector T p : 

11 Xi2 Xis ... X; 



(3) 



X r l 



\r3 



x rr 



The relation between the oblique reference vectors and the primary trait 
vectors can be generalized as follows. Let i^p be the first minor of \ mp in (3). 
Then it can be shown that the direction cosines of the primary trait vector 
T p are proportional to the entries in column p in the matrix H in (4). Let 
the matrix T be produced by normalizing the columns of H, The columns 
of T are the direction cosines of the primary trait vectors T p : 



(4) 



(-1)" 



= #. 



The cosine of the angular separation of each pair of primary trait vectors 
is the correlation between the corresponding primary traits in the experi- 
mental population. It will probably be found that these correlations are 
positive. If the elements VJ P are taken positive or zero, then the angular 
separations of pairs of hyperplanes L P exceed v/S. Hence it is to be expect- 
ed that the scalar products, or correlations, of pairs of oblique vectors A p 
will be negative. This relation can be illustrated in the plane as follows. 

Let I and II in Figure 4 represent the orthogonal centroid axes of F. Let 
the small circles along the line 1 represent traits which all contain the same 
primary trait. In a similar way let the small circles along the line 2 repre- 
sent traits that contain a second primary trait. Let the two primary traits 
be positively correlated in the experimental population. This is shown by 
the fact that the angular separation between 1 and 2 is less than a right 
angle. 



PRIMARY TRAITS 



161 



The radial line 1 is a subspace of one dimension. It is defined by its 
normal A 2 . The subscript of A 2 shows the trait which is absent in the sub- 
space 1. The correlations between A 2 and the traits in 1 are all zero. The 
correlations between A 2 and the traits in 2 are all positive. In a similar man- 
ner AI is the normal to the subspace 2. All correlations between AI and the 
traits in 2 are zero. The correlations between AI and the traits in 1 are posi- 
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FIGURE 4= 

tive. While the angular separation between 1 and 2 is less than x/ 2 , it is 
seen that the angular separation between AI and A 2 exceeds ?r/ 2 . Hence its 
cosine is negative. 

The convincingness of the primary traits that are isolated by the three 
criteria for overdetermined simple structure necessarily depends on the in- 
ventiveness of the scientist in formulating a plausible concept or descriptive 
category for each column of F. When that has been done, the further veri- 
fication of these concepts demands that additional experiments be made 
with traits or variables in which the categories are represented in extreme 
form. For example, if facility in auditory imagery were postulated as a pri- 
mary factor in dealing with certain verbal tests, the verification of this fac- 
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tor would require additional experimentation in which the same verbal tests 
are used in combination with non-verbal tests of auditory facility. If the 
latter tests retain even more conspicuous values in the auditory column 
than the auditory verbal tests, then the auditory factor is further experi- 
mentally affirmed. 

The equation of an oblique simple structure 

If a set of r hyperplanes exists such that each of the trait vectors is in at 
least one of the hyperplanes, then the configuration is called a simple struc- 
ture. The general case is that in which the hyperplanes are oblique. The 
special case in which the r hyperplanes are orthogonal is called an orthog- 
onal simple structure. A simple structure is a set of r oblique hyperplanes, 
all of which contain the origin. This set of r hyperplanes may be regarded 
as a degenerate cone whose apex is at the origin and whose surface consists 
of the r hyperplanes. The equation of the hyperplane L p is 



(5) 



2 

m=l 



v 



The equation of a simple structure in r dimensions may be written by 
setting the product of r polynomials, like (5), for p = l, 2, 3, . . . , r, equal 
to zero. Then we have 







(6) 



This equation may be written in the more condensed form 



(7) 



n 



. 



Fitting (7) to a given trait configuration in which a simple structure is 
assumed, we have, for each test j, 



(8) 
or 



(9) 



n , = o . 
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If the pointy is in at least one of the r hyperplanes of (7), at least one of the 
r factors v 3 - p vanishes, and hence equation (9) is satisfied. Equation (7), or 
its equivalent (9), is therefore satisfied by all points in the r co-ordinate 
hyperplanes of a simple structure. 

In order to determine the best-fitting degenerate cone for a given set of n 
points in a space of r dimensions, equation (7) may be written in the form 
of an observation equation, namely, 



do) n 



P =i 



where pj is the discrepancy for the point j. The best-fitting simple structure 
may be defined as that in which 

n 

V* 



is minimized. Hence the criterion for a best-fitting simple structure is the 
minimizing of 



n 2 a -^ ~s n * - 



y=i 

The function <j> will be referred to as the criterion for the isolation of a simple 
structure. 

Five methods of isolating primary traits 

When a factorial matrix F has been obtained from the correlational 
matrix R by any method, the second principal problem is to find the trans- 
formation G by which overdetermined simple structure may be discovered 
in the n traits or variables. Five methods will be described, namely: 

1) Graphical method when r< 4, 

2) Method of oblique axes, 

3) Method of averages, 

4) Method of maximizing zero entries in each column of F, 

5) Analytical method. 

When the rank of F is less than 4, the problem is quite simple, because the 
solution may be written by graphical methods. These will be described in 
later sections of this chapter. 

When a psychological hypothesis in the form of a postulated primary 
trait is to be tested, the second and third methods are applicable. The 
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method of oblique axes is described in chapter vii. It is a variant of the 
method of principal axes which avoids the pitfalls of the principal axes. The 
method of averages is also described in chapter vii. It is an approximation 
to the method of oblique axes, and it does not require the determination of 
the roots of a characteristic equation. Either of these methods may be used 
for testing directly whether a postulated trait is primary. The fourth meth- 
od isolates one hyperplane at a time in which the number of nearly vanish- 
ing entries is maximized. 

The fifth method is entirely analytical in that it extracts the primary 
traits, if they exist, without presupposing any hypothesis regarding their 
nature. By means of the analytical method the primary traits may be found 
even if their nature is entirely unknown. The analytical method is described 
in chapter vii. 

Graphical methods for less than four dimensions 

If the common-factor space has one dimension, the factor problem is 
solved directly by the methods of chapter v. If the rank is 2, the co-ordi- 
nates at*, of F may be plotted on cross-section paper so that the configura- 
tion of F becomes visible in a plane. Simple structure is then revealed if all 
of the trait vectors are found to lie along two radial lines. The direction 
cosines of the normals to these lines constitute the columns of the trans- 
formation Gj which can then be written by inspection. These direction 
cosines can also be thought of as defining two linear subspaces in the plane. 

If the rank of F is 3, the graphical procedure implies a solid model Two 
methods of handling this case will be described: 

1) The trait vectors may be represented by old-fashioned hatpins that 
are stuck into a central spherical cork. The length of each hatpin j should be 
equal to hj. The angular separation fak for each pair of hatpins, j and k, 
should be such that r# = A/ A* cos 0#. Simple structure is demonstrated if 
each hatpin lies in one of three planes all of which contain the origin. The 
direction cosines of the normals to these planes constitute the columns of 
the transformation G. 

2) The method of sticking hatpins in a cork has not been tried. A sim- 
pler method is to use the augmented co-ordinates Ajm of the trait vectors. 
These are also the direction cosines of unit trait vectors. They can be repre- 
sented as points on the surface of a ball. 

In plotting the trait configuration on a sphere, it will be found conven- 
ient to use the following method: Locate three orthogonal points on the 
surface and mark them I, II, III, to represent the three reference vectors for 
the direction cosines of the traits. Mark off on a narrow strip of paper the 
distance 7rD/4, where D is the diameter of the sphere. This is also the sur- 
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face distance between any two of the orthogonal reference points. Divide 
this distance into ninety parts in any convenient units to represent 90. On 
the same strip of paper mark off the cosines of angles with any convenient 
unit such as .00, .05, .10, etc. In doing this, look up cos" 1 .05, cos" 1 .10, and 
mark .05, .10 on the strip at the appropriate angles. The strip is then ready 
for use. In locating a point on the sphere, use the arithmetical check which 
is provided by the fact that the position of each point is determined by the 
angular separations from two reference points. The angular separation from 
the third reference point constitutes an arithmetical check. 

Simple structure is demonstrated if each point lies in one of the sides of a 
spherical triangle. Each of the three sides determines a plane through the 
origin. The direction cosines of the normal to each plane constitute a col- 
umn in the transformation G. The vertices of the triangle define the pri- 
mary traits. The correlations between the primary traits in the experi- 
mental population are the cosines of the intervening sides of the spherical 
triangle. If the figure is a right spherical triangle, the primary traits are un- 
correlated in the experimental population. If the figure is an oblique spheri- 
cal triangle, the primary traits are correlated in the experimental popula- 
tion. 

The problem of negative abilities 

All of the methods of factoring a correlational matrix that have been de- 
scribed give a factorial matrix with negative cell entries in the second and 
subsequent columns. The numerical values in each row of the factorial 
matrix F describe one of the traits in terms of arbitrary orthogonal refer- 
ence axes. Since the axes are arbitrary, the psychological interpretation of 
the matrix F, as obtained by the centroid method or by any other equiva- 
lent method, is certain to lead to erroneous results unless the matrix is 
rotated so as to satisfy some additional criterion of the relation between the 
trait configuration and the reference axes. 

If the variables in the correlational matrix represent personality traits 
other than abilities, then either positive or negative values of Vj m are ad- 
missible to psychological interpretation. For example, "tactfulness" and 
"tactlessness" are two traits which can be so defined that their co-ordinates 
are identical except for reversal of sign. Two disease symptoms might be in 
a similar inverse relation. The likes and dislikes of people might be related 
in the same manner. The trait "stability" probably has a negative projec- 
tion on the reference vector of "emotionality." 

If the traits in the correlational matrix represent abilities, it is not likely 
that the values of / in V will be negative. By one interpretation, a nega- 
tive value of Vfa would mean that performance in a psychological test j is 
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actually facilitated by the lack of some sort of ability m. Ideal constructs 
can be devised so as to allow some plausible interpretation for what might 
be called "negative abilities/ 7 but this does not seem to be necessary. This 
reasoning leads to the 

Hypothesis: When unique simple structure is found for a battery of psy- 
chological tests, then the non-vanishing entries in the factorial matrix 
are positive. 

It will be convenient to name the bounded space within which any 
radial vector has only positive direction cosines. This space will be de- 
fined as follows: 

Definition: The bounded space in which any radial vector has only positive 

direction cosines will be called the positive region. 

Definition: If all the trait vectors that do not lie in a hyperplane are on the 
same side of it, the hyperplane will be called a positive hyperplane 
with reference to the trait configuration. 

Definition: // a set of r positive hyperplanes exists such that each trait 
vector is contained in one or more of them, then the combined configura- 
tion of the trait vectors and the reference vectors will be called a positive 
simple structure. 

The geometrical interpretation of the restriction upon the numerical val- 
ue of a im in F in the case of mental ability tests is that all of the test vectors 
lie in the positive region of the common-factor space. When this condition 
is satisfied, all of the intertest correlations are positive or zero. It is a uni- 
versally accepted fact that intertest correlations are positive. 

The converse is not necessarily valid. The well-known fact that all inter- 
test correlations are positive implies that all of the test vectors lie inside a 
cone with center at the origin and with a generating angle of 7r/4. Such a 
cone cannot be inscribed in the positive region except when the number of 
dimensions is as low as two. 

The restriction that all of the test vectors shall be in the positive region 
of the common-factor space is not sufficient to determine F uniquely. In 
general, there exists an infinite number of orthogonal transformations by 
which all of the entries in F become positive or zero if the configuration of F 
can be inscribed in the positive region. Special cases may be set up in which 
one, and only one, orthogonal transformation will make the entries a/ in F 
positive or zero. Such a case in three dimensions is that in which three test 
vectors are mutually orthogonal. These cases are not likely to be found in 
practice. Hence a unique matrix F is not to be expected with the single cri- 
terion that a/^0 in F. 
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Graphical analysis of fifteen psychological tests 

The graphical methods will be illustrated on the fifteen psychological 
tests of Brigham which were used for numerical examples in the third and 
fourth chapters. The fourth column in Table (25-iii) contains entries whose 
maximum contribution to any correlation coefficient is about .067. This is 
not large enough to justify serious consideration, and hence the test vectors 
can be represented in three dimensions with fair approximation. In Table 2 
are recorded the first three centroid factors and the communalities for the 
first three factors. The last three columns show the corresponding aug- 
mented co-ordinates. 

Table 2 



TESTS 


CENTROID CO-ORDINATES 


A= 


DIRECTION COSINES 


I 


II 


III 


I 


II 


III 


10. 


Opposites 


.642 


.443 


-.150 


.6309 


.808 


.558 


-.189 


2. 


Opposites 


.579 


.499 


-.090 


.5923 


.752 


.648 


-.117 


5. 


Opposites 


.561 


.449 


-.041 


.5180 


.779 


.624 


-.057 


3. 


Analogies 


.712 


.228 


.092 


.5674 


.945 


.303 


.122 


4. 


Artificial language 


.633 


.134 


.061 


.4224 


.974 


.206 


.094 


1. 


Definitions 


.685 


.159 


.157 


.5192 


.951 


.221 


.218 


8. 


Geometrical completion 


.529 


-.144 


.207 


.3434 


.903 


-.246 


.353 


7. 


Arithmetical problems 


.559 


-.146 


.233 


.3881 


.897 


-.234 


.374 


9. 


Arithmetical proportions 


.546 


-.222 


.162 


.3736 


.893 


-.363 


.265 


6. 


Number series 


.585 


-.293 


.274 


.5032 


.825 


-.413 


.386 


15. 


Card-turning 


.475 


-.112 


-.132 


.2556 


.939 


-.222 


-.261 


14. 


Block construction 


.428 


-.235 


-.149 


.2606 


.838 


-.460 


-.292 


17. 


Dice-counting 


.619 


-.303 


-.194 


.5126 


.865 


-.423 


-.271 


11. 


Painted cubes 


.598 


-.313 


-.272 


.5296 


.822 


-.430 


-.374 


18. 


Form-learning 


.436 


-.084 


-.099 


.2070 


.958 


-.185 


-.218 



The fifteen test vectors can be represented as points on a sphere. When 
this is done, the configuration of the test vectors can be inspected independ- 
ently of the arbitrary centroid reference planes. In Figure 5 the configura- 
tion is shown by plotting the augmented factor loadings III against 77. The 
first factor is then perpendicular to the plane of the diagram. 

Inspection of the sphere shows that three reference planes may readily 
be located so that each of the test vectors lies in at least one of the three 
planes or very near to one of them. The plane AB is determined by the 
centroids of the tests 2, 5, and 8, 7. The plane AC is determined by the cen- 
troids of the tests 6, 9, and 11, 14. The plane EC is determined by the cen- 
troids of the tests 10, 2, and 11, 14. 

The direction cosines of the three planes are shown in Table 3. This table 
is the matrix G of the transformation of F into V. Each column shows the 
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direction cosines of an oblique reference vector A p; and these are also the di- 
rection cosines of one of the three subspaces of dimensionality (r 1). In 
the present example these subspaces are the planes AB } AC, and BC. 




FIGURE 5 


Table 3 




AA 


A* 


AC 


I 
II 
III 


.304 
-.154 
.940 


.441 
.893 
-.088 


.244 
-.415 
-.876 



Table 4 shows the matrix V for this particular problem. It can be seen 
that those test vectors which lie close to one of the three reference planes L p 
in Figure 5 are also those which are represented with nearly vanishing en- 
tries in the corresponding column p of Table 4- 
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The three primary abilities are defined by the intersections of reference 
planes. These are shown at the points A, B, and C. The direction cosines 
of the primary ability vectors T P may be obtained either by the intersec- 
tion of pairs of planes L p or, more formally, by the matrix (4). The matrix 
T for the present example is shown in Table 5. 

The matrix of Table 4 shows simple structure, but it is not a unique con- 
figuration with the reference planes. The reason is that there are no test 

Table 4 





r jAc 


r /A* 


*JAA 




I 


II 


III 


10 


.104 


.692 


-.014 


2 


.013 


.709 


.015 


5 


-.014 


.652 


.063 


3 


-.001 


.510 


.268 


4 


.045 


.393 


.229 


1 


-.036 


.430 


.331 


8 


.008 


.086 


.378 


7 


-.007 


.096 


.411 


9 


.083 


.028 


.352 


6 


.024 


-.028 


.481 


15 


.278 


.121 


.038 


14 


.332 


-.008 


.026 


17 


.447 


.019 


.052 


11 


.514 


.008 


-.026 


18 


.228 


.126 


.052 



Table 5 





TA 


TB 


TC 


I 


.834 


.722 


.829 


II 


-.372 


.681 


-.443 


III 


.408 


-.122 


-.341 



vectors along the arc AC or along the arc BC, The five test vectors near C 
may be regarded as identical except for experimental errors. It is for this 
reason that the primary traits A, B, and C cannot be inferred with certain- 
ty. Simple structure would be obtained as well by drawing a reference plane 
through the centroid of (1, 3, 4) and (15, 18) instead of the plane BC. The 
simple structure would involve this plane and the planes AC and AB. The 
psychological interpretation would be difficult because the tests at B would 
have negative factor loadings for the trait A. Hence the structure shown in 
Figure 5 is the more probable one, though it cannot be demonstrated with 
certainty. The simple structure of Figure 5 can be shown to be unique 
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only by a more extensive test battery which includes tests along AC and 
along BC. 

With the reservations just written, it is of some interest to note the pri- 
mary abilities A, 5, and C, and to consider tentatively their psychological 
nature. Since B consists of opposites tests, one might postulate a verbal 
factor. The tests at A are numerical, so that a number factor might also be 
postulated. Inspection of tests 1, 3, 4, shows them to lie in the plane AB. 
This seems reasonable since they are verbal in character; but they also con- 
tain some of the precision and restrictiveness of numerical work. This rela- 
tion raises the psychological question whether the number factor is essen- 
tially concerned with number as such or with some kind of facility for logi- 
cal or other restrictive thinking of which numerical work is only a good ex- 
ample. This is a question of fact which can be established by experimental 
inquiry with larger test batteries. The factor C is evidently concerned with 
visual imagery and perhaps with kinesthesis. The battery does not contain 
enough tests to establish their separation if they are separate abilities. 



CHAPTER VII 

ISOLATION OF PRIMARY FACTORS 
Method of oblique axes 

This method was devised for testing the hypothesis that a specified trait 
is primary. For example, if the hypothesis is entertained that a space factor 
is primary in the fifteen tests of Brigham, the method of oblique axes makes 
it possible to test this hypothesis directly. The method is general and in no 
sense limited to psychological tests which are used for illustrative purposes. 

Let the trait which is postulated as primary be denoted T p . If the hy- 
pothesis is clearly formulated, it should be possible to describe T p in terms 
of other traits in which it is involved and in terms of still other traits in 
which the supposed primary component T p is absent. If a battery of n traits 
has been found to involve r factors and if the postulated trait T p is primary 
in this battery, then there should be only (r 1) factors in the residual bat- 
tery which is obtained by merely eliminating those traits in which T p can 
be involved. This idea can be illustrated with a postulated space factor as 
an example. If the fifteen tests of the battery contain several tests that in- 
volve space thinking and if the whole battery is well described by three 
factors, then the residual battery which is obtained by eliminating the space 
tests should be describable in terms of only two factors. 

Each trait in the residual battery is described in terms of r factors in the 
factorial matrix F. It does not matter for the present problem that the ref- 
erence axes of F are arbitrary. 

After eliminating the traits that may conceivably involve the postulated 
primary trait T P , let there remain n$ traits in which T p is almost certainly 
absent. When these rows of F have been eliminated, there remains a fac- 
torial matrix F of n rows and r columns. The rank of the reduced factorial 
matrix Fo must be (r 1) if T p is primary in F. 

The trait configuration of the n Q traits involves, therefore, (r 1) dimen- 
sions; but each one of them is described in terms of r co-ordinates in F& If 
the r principal axes are determined for the matrix F Q by the methods of 
chapter iv f one of the roots ft must vanish, because the extension of the trait 
configuration is vanishing in the rth dimension. This situation was antici- 
pated in the first numerical example of chapter iv, where the system was in- 
tentionally devised so that one of the roots ft did vanish. If n roots ft vanish 
in the characteristic equation, then the trait configuration is of dimension- 
ality (rn\) y and this is also the number of primary traits in the battery. 
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The verification of T p as a primary trait is accomplished by the following 
procedures: 

1) Eliminate from the factorial matrix F those traits which may con- 
ceivably involve the postulated primary trait T p . This gives the reduced 
factorial matrix F Q of n rows and r columns. 

2) Make sure that FQ satisfies the inequality (5 ii), so that the trait 
configuration is unique. 

3) Determine the r roots ft of the characteristic equation for F Q . 

4) If only one root ft vanishes, then one, and only one, primary factor 
was removed from F by its reduction to FQ. The trait T p is therefore 
primary. 

5) If ni roots ft vanish, then n^ primary factors were removed from F in 
its reduction to FQ, and the hypothesis is obscured. In supposedly removing 
one primary factor, several primary factors were removed. The traits must 
be re-examined in order to ascertain whether additional, but yet unformu- 
lated, primary factors were inadvertently removed from F together with 
T P , or whether the trait T p is itself a trait of multiple complexity ni. 

6) If none of the roots ft vanish, then the hypothesis is disproved, because 
the residual factorial matrix F Q is of the same rank as the original matrix F. 
No primary factor has been removed from F. The problem then calls for 
another guess about the nature of the primary factors; 

If this process is repeated for r successive postulated traits and if these 
are verified in the same manner, then the result will be a set of r primary 
trait vectors in terms of which the oblique factorial matrix V may be 
written. The present method is called a "method of oblique axes" because 
the co-ordinate axes of V are not necessarily orthogonal. The angular sepa- 
rations of the primary trait vectors of V are functions of the intercorrela- 
tions of the primary traits in the experimental population. These intercor- 
relations are affected by the fortuitous conditions that vary more or less 
uncontrollably from one sample population to another. It is one of the fun- 
damental problems of factorial analysis to transcend these fortuitous condi- 
tions that characterize random samples. As long as the discovery of the 
fundamental categories of a science is markedly affected by the fortuitous 
elements of random sampling, the categories are not likely to be significant. 

Numerical example of method of oblique axes 

The method will be illustrated on the battery of fifteen tests that were 
used in chapter Hi as a numerical example. Table (25-iii) shows the factor 
loadings for the fifteen tests. Since the first three factors account for the 
intercorrelations within small residuals, only the first three columns of the 
factorial matrix will be used. This will facilitate a comparison of the results 
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of the present example with the graphical methods previously used for the 
same problem. Inspection of the tests suggests that a visual or space factor 
is present in some of them. The tests which are most conspicuously spatial 
in character are 11, 14, 15, 17, 18. After eliminating these five tests, there 

Table 1 



Tests 


I 


II 


III 


10 


Opposites 


.642 


.443 


-.150 


2 


Opposites 


.579 


.499 


-.090 


5 


Opposites 


.561 


.449 


-.041 


3 


Analogies 


.712 


.228 


.092 


4 


Artificial language 


.633 


.134 


.061 


1 


Definitions 


.685 


.159 


.157 


8 


Geometrical completion 


.529 


-.144 


.207 


7 


Arithmetic problems 


.559 


-.146 


.233 


9 


Arithmetical proportions 


.546 


-.222 


.162 


6 


Number series 


.585 


-.293 


.274 



remain ten tests for the residual factorial matrix jP . This is reproduced for 
the first three centroid factors in Table 1. Although each of these ten tests 
is here described by three co-ordinates, the rank of the matrix should be 
only 2. Hence one of the roots $ should vanish. 

Table 2 





l 


2 


3 


1 

2 
3 


3. 671647 -HS 
.730882 
.528743 


.730882 

.919257+0 
-.255728 


.528743 
-.255728 
, 267573 +/3 



In Table 2 the cross products are summarized. This table shows the co- 
efficients of the simultaneous equations (13-iv). The three roots ft of the 
characteristic equation are as follows: 

ft. = -3.910283 , ft = - .930210 , ft = - .017984 . 

It is seen that one of these roots is almost zero. Since this root is the sum 
of the squares of the projections of the test vectors on the minor principal 
axis, it is seen that the mean squared projection for the ten tests is .0018. 

The direction cosines of the vector which is determined by substituting 
ft in (13-iv) are as follows: 



X 18 = + . 211980 , X 23 = - .422010 , 



- .881460 . 
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In Table 8 are listed the projections of the fifteen test vectors on the vector 
A 3 ; and it is of special interest to note that the ten tests of the residual bat- 
tery have nearly vanishing projections on As, whereas the five tests which 
were postulated to contain a space factor have marked positive projections 
on AS. This result proves that a primary factor was removed from the test 
battery when the five space tests were eliminated. 

Table 3 



Tests 


JA3 


10 


Opposites 


.081 


2 


Opposites 


-.009 


5 


Opposites 


-.034 


3 


Analogies 


-.026 


4 


Artificial language 


.024 


1 


Definitions 


-.060 


8 


Geometrical completion 


-.009 


7 


Arithmetic problems 


-.025 


9 


Arithmetical proportions 


.067 


6 


Number series 


.006 


15 


Card turning 


.264 


14 


Block construction 


.321 


17 


Dice counting 


.430 


11 


Painted cubes 


.498 


18 


Form learning 


.215 



Constellations 

In formulating hypotheses concerning the nature of the primary traits, 
it is sometimes a considerable aid to know of constellations that may exist 
in the trait configuration. By a "constellation" is meant a grouping of trait 
vectors. It happens not infrequently that the trait configuration consists 
essentially in groups of trait vectors. The angular separations between the 
trait vectors within a constellation are relatively small, while the separa- 
tions between constellations are marked. 

When the dimensionality of the factorial matrix is less than four, the 
constellations may be inspected readily by graphical methods. When the 
dimensionality exceeds three, the graphical methods are not available, and 
it is then useful to have a routine by which the constellations may be iso- 
lated in the trait configuration. Since the constellations are to be used as an 
aid to intuition regarding the nature of the primary traits, it is not advisable 
to define a constellation rigorously as regards maximum angular separations 
or as regards the maximum generating angle of the cone which shall include 
a constellation. Such restrictions may be arbitrarily imposed by the in- 
vestigator for each study in terms of the dimensionality of F and the mean 
order of magnitude of the communalities that are involved. 
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If an attempt is made to isolate constellations from a large battery of 
traits, say fifty or more, without some systematic procedure, it is usually 
found that the groupings become entangled in annoying complexity. If the 
constellations do not exist, the procedure must make this fact evident; but, 
on the other hand, constellations can be drawn for the purposes of studying 
the battery even though the traits arrange themselves more in the nature 
of chains than constellations. In three dimensions this situation is illus- 
trated by a battery of traits whose configuration reveals a spherical triangle 
in which the sides of the triangle are pretty well defined by the trait vectors. 
If all of them lie in sides of a spherical triangle, then the isolation of constel- 
lations would be difficult, because there may be no sharp break between one 
constellation and the next. In three dimensions the graphical methods 
would, of course, be used because of their simplicity and directness; but in 
higher dimensions the groupings may be obtained by inspectional methods 
from the intercorrelations corrected for uniqueness. 

One useful procedure is to ascertain first the average correlation in each 
column of the correlational matrix R u where the given coefficients have been 
corrected for uniqueness. (An alternative is to count in each column the 
number of coefficients whose absolute values exceed, say, .80 or .90.) Select 
the trait T x with highest mean coefficient. List all the traits whose correla- 
tions with T x exceed .80 and complete the correlation table for the traits so 
selected. Eliminate from the table the trait which has the largest number 
of intercorrelations less than .80. Repeat the eliminating process until all 
the traits that remain in the table have intercorrelations that exceed .80. 
These traits constitute a constellation. Select the trait whose mean co- 
efficient is next highest and which is not listed in the group just formed, and 
proceed with it in the same manner as with T x until the majority of the 
traits have been assigned. These groupings are flexible, and they may be 
arranged to overlap. The arrangement of the traits in constellations should 
be regarded merely as a device for studying them and for formulating hy- 
potheses concerning the underlying primary factors. 

The method of averages 

This method is a modification of the method of oblique axes. Though not 
theoretically so satisfactory, it is useful, since it gives results which are 
closely similar to those of the method of oblique axes; and it is shorter in 
computational work, in that it does not require the determination of the 
roots of the characteristic equation. 

When the traits which may contain, the postulated primary factor have 
been eliminated from F, there remain n% traits in the reduced factorial ma- 
trix F Q . These are arranged in ascending or descending order of magnitude, 
according to the co-ordinates in one of the columns of F Q which shows a con- 
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siderable range in numerical values. The second column is usually one of 
the best if F has been computed by the centroid method. The n traits of F Q 
are then divided into (r 1) groups. The co-ordinates of the centroid of 
each group are then determined. These define (r 1) points in the common- 
factor space. It is desired to find the hyperplane L which contains the n 
traits in FQ. In the method of averages the hyperplane L is taken to be that 
hyperplane which contains the vectors whose termini are the centroids of 
the (r 1) groups of traits. The projection of each of the n Q traits on the 
normal to the hyperplane L is then determined. These should all vanish. 
If A is a principal axis of the system, or if A is near one of the principal axes, 
then the sum of the squares of the projections of the no trait vectors on A will 
be nearly equal to that root of the characteristic equation which is zero 
or nearly vanishing. On the other hand, the projections of the (n UQ) 
eliminated traits should have appreciable projections on A in order to estab- 
lish that a primary factor was removed in reducing F to FQ. 

Numerical example of the method of averages 

The same set of fifteen psychological tests will be used as an example. 
The ten tests in FQ may be divided according to the signs of the second col- 
umn of FQ into (r-l) = 2 groups as follows: A- (10, 2, 5, 3, 4, 1) and 5 = 
(8, 7, 9, 6). The centroids of these two groups of points are as follows: 





I 


II 


III 


A 


.6353 


.3187 


.0048 


B 


.5548 


- .2012 


.2190 



Let the co-ordinates of the centroid of group A be Ai, A 2 , Az and let the 
corresponding coordinates for group B be B\, jB 2 , J5 3 . Then if the two vec- 
tors A and B are to be contained in the plane L, it is necessary that both 
A and B have vanishing projections on the normal A to the plane L. Hence 

Ai\i + .A 2 X 2 + A s \s = , 
BiXi + 2 X 2 + J3 3 X 3 = . 

Expressing all of the Vs in terms of one of them, and normalizing, we have 
the values listed in Table 4* It is interesting to note that these values are 
nearly the same as those which were found by the method of oblique axes. 
These are shown in the first column of Table 4- In Table 5 are shown the 
projections of the fifteen test vectors on A as defined in the second column 
of Table 4- Note that the projections of the ten tests of FQ almost vanish, 
while the projections of the five space tests which were removed from F are 
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appreciable in magnitude. These results could, of course, be predicted by 
inspection of the sphere on which the fifteen normalized test vectors were 
plotted. 

Table 4 





Method of 
Oblique Axes 


Method of 
Averages 


Xi 


+ .212 
-.422 

-.881 


+ .207 
-.400 
-.893 



Tabled 



Tests 


r yAs 


10 


Opposites 


.090 


2 


Opposites 


.001 


5 


Opposites 


-.027 


3 


Analogies 


-.026 


4 


Artificial language 


.023 


1 


Definitions 


-.062 


8 


Geometrical completion 


-.018 


7 


Arithmetic problems 


-.034 


9 
6 


Arithmetical proportions 
Number series 


.057 
-.006 


15 


Card turning 


.261 


14 


Block construction 


.316 


17 


Dice counting 


.423 


11 


Painted cubes 


.492 


18 


Form learning 


.212 



The special case of rank 2 

When the matrix F has been obtained, the communalities are known, so 
that the matrix F u can be written in which the cell entries show the direc- 
tion cosines of the augmented or unit trait vectors. The cross products 
F u F'u = R U) in which R u contains the intercorrelations corrected for unique- 
ness. 

Let some of the traits be of complexity 2, so that their intercorrelations 
may be described linearly in terms of two factors. Let a new matrix ^ be 
formed whose entries <pjk are determined by the relation 



(1) 
where 



cos 



" 1 



Rjk 
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Select any two columns of <, say I and m. Then, if the other trait vectors 
lie in the plane of the two trait vectors Z and m, we have 

(2) <t>jl = <j> 3 'm + 4>ml 

If the column I of 3> is plotted against the column ra, the plot should be 
linear with a slope of unity. The ^-intercept is the angular separation < mZ 
between the trait vectors m and I. In this graphical procedure it becomes 
immediately evident which of the traits in R Q are coplanar, or nearly co- 
planar, with the pair of traits I and m. All the traits j which are represented 
in the graph on a line through the Z-intercept of ^ Zm = cos~ 1 R im with a slope 
of unity can be described linearly in terms of two factors. It is of interest 
to note that a constant can be added to each column of the matrix so that 
all columns become proportional. The rank is then reduced to 1. 

Projections of unit trait vectors into a hyperellipsoid 

Let a/*, represent each element in a matrix Q of order nXs of rank r, and 
let the matrix be normalized by rows so that 

(3) V 4, = i . 

fc-1 

Then each of the 5 cell entries a^ in each row j can be expressed linearly in 
terms of r independent cell entries where r^s. There is no loss of generality 
in rearranging the rows and corresponding columns so that the r inde- 
pendent columns become the first r columns of the matrix. Then we have 



(4) a,-jb 

This linear description of a# can be condensed into the form 



(5) 

Squaring, we have 

//*\ 2 A 2 2 i A Z 2 i f /< 2 2 i 

(") &jfe = -o.ifcfl/1 + A2fctty2 + * * ' + -oLr&ftjV + 

This equation can be condensed as follows: 
(7) <4 = 
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(8) 



Equation (8) can be regarded as an equation in the r unknowns a/ m where 
ra=l, 2, . . . , r. Equation (8) represents a hyperellipsoid in the space of r 
dimensions which is defined by the r columns of the given nXs matrix. The 
hyperellipsoid of (8) contains the n points represented by the n rows of Q. 
Hence we have the following: 

Theorem: If a matrix Q of order nXs is of rank r and if it has been normal- 
ized by rows, then any set of r independent columns of Q define the r 
Cartesian co-ordinates of each of n points which lie in the surface of a 
hyperellipsoid in a space of r dimensions with center at origin. 
Specializing this theorem to rank 2, we have 

Theorem: If a matrix Q of order nXs is of rank 2 and if it has been nor- 
malized by rows, then any pair of independent columns determines the 
Cartesian co-ordinates of each of n points on an ellipse with center at 
origin. 

One example of this theorem will be shown for rank 2. Table (1 0-iii) shows 
the intercorrelations of eight fictitious variables whose correlation matrix is 

Table 6 





1 


2 


3 


4 


5 


6 


7 


8 


S 2 


1 


.458079 


.400819 


.114520 


.171780 


.515338 


.458079 


.286299 


.171780 


1.000000 


2 


.336733 


.390851 


.228497 


.294641 


.402877 


.432942 


.378824 


.318694 


1.000000 


3 


.138219 


.328270 


.345547 


.414657 


.207328 


.345547 


.449212 


.466489 


1.000000 


4 


.166306 


.339542 


.332612 


.401907 


.235600 


.360330 


.443483 


.450412 


1.000000 


5 


.431563 


.401594 


. 143854 


.203794 


.491503 


.455539 


.311685 


.209788 


1.000000 


6 


.349565 


.393261 


.218478 


.284022 


.415108 


.436956 


.371413 


.305869 


1.000000 


7 


.230121 


.362440 


.299157 


.368193 


.299157 


.391205 


.425723 


.408464 


1.000000 


8 


.151015 


.333492 


.339784 


.408999 


.220230 


.352369 


.446753 


.459338 


1.000000 



of rank 2. When the table is normalized by rows 7 the cell entries take the 
values shown in Table 6. The first two columns are plotted in Figure 1, and 
it is seen that the points determine an ellipse with center at origin. 

A method of maximizing the number of zero factor loadings (the single 
hyperplane method) 

When the factorial matrix F has been obtained, it is desirable to be able 
to extract the primary abilities by methods that are not dependent on any 
hypotheses concerning their nature. The primary abilities have been de- 
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fined as those factors which reduce to a minimum the number of factors per 
test that will account for the intercorrelations. Since the primary abilities 
are likely to be positively correlated in all readily available experimental 




FIGTJBE 1 

groups of subjects, the methods of isolating the abilities must be free from 
the restrictions of orthogonality. The simplest underlying structure is indi- 
cated by a transformation G of F by which the number of vanishing entries 
in V is maximized. A large number of zero entries in each column p of V 
constitutes assurance of an underlying order among the variables whereby 
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each one of them can be described by a number of scientific categories that 
is smaller than the rank of the correlational matrix. In general, such a trans- 
formation does not exist for a factorial matrix that is produced with arbi- 
trary cell entries. Geometrically, this problem can be described as an at- 
tempt to discover a set of r hyperplanes so defined that each of the n test 
vectors lies in one or more of the hyperplanes. Since each hyperplane is of 
(r 1) dimensions, it is clear that one primary ability is absent from all test 
vectors in each hyperplane. If every test vector is so contained, it follows 
that there will be at least one vanishing entry in each row j of V. 

In the extraction of primary abilities, each of the r hyperplanes will be 
sought in succession. This is feasible, since there are no conditions govern- 
ing the relations between the hyperplanes or between the primary trait 
vectors except that they be distinct. The method to be described consists 
in finding a hyperplane that will contain as many as possible of the test 
vectors. The fact that a test is contained in a hyperplane can also be re- 
garded as a zero correlation between the test and the normal to the hyper- 
plane. This normal can be thought of as an imaginary test. It is desired, 
then, to find a vector A p in the common-factor space with which the maxi- 
mum number of tests have zero correlation or for which the number of zero 
correlations is larger than for any neighboring vector. 

Since the correlation coefficient is a continuous function of the angular 
separation of the test vector and the reference vector A p , it is desirable to 
maximize, not the absolute number of tests whose correlations with A p 
vanish exactly, but rather some function of this correlation that has a large 
value when the correlation is near zero and which has insignificant values 
when the correlation becomes appreciable. This should be a function of the 
square of the correlation in order that the function be symmetric. 

There is a very large number of functions of rj& which can be used for 
the present purpose, but many of them must be eliminated for statistical or 
computational reasons, Let w represent the function of r/A, and let 

(9) u EE 

be the function which is to be maximized in order to insure a large number 
of vanishing entries in a column p of V. 

One of the simplest functions that satisfy the demands of this problem is 

(10) < = -r ; 

but this function has the statistical limitation that when the correlation 
vanishes, w becomes infinite, so that it cannot be handled computationally. 
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This defect can be remedied by modifying the function to 

(11) w ip - j- c , 

where c is some small arbitrary constant such as +.01. The maximum value 
of w is then 100 when the correlation vanishes, and it is nearly unity when 
the correlation is unity. 

It seems certain that better functions and simpler computational meth- 
ods will be found than those which are to be described here. The excuse for 
presenting a method which is not yet the simplest is that it does locate each 
column of V in which the number of zero entries is maximized. While this 
method is useful, it cannot be applied automatically because hyperplanes 
may exist in a particular system which contain groups of traits but which 
do not define the most appropriate scientific categories. 

If it is postulated that all of the entries in V shall be positive or zero and 
that negative factor loadings are to be excluded, then the precaution must 
be taken to carry the factorial matrix F to a sufficient number of factors. 
If this is not done, negative factor loadings will appear in V even though 
the tests can be described in the positive region of a common-factor space 
in higher dimensions than those which are assumed in F. This is largely a 
question of judgment as to when the residuals are small enough to be ig- 
nored. There should be no harm in carrying the factorial matrix to a num- 
ber of columns larger than needed. 

Since r^=v 3 ' p , we have 

(12) w iP = 



vj + c ' 

It is desired to find the vector A p for which u is maximized. The vector A p 
is defined in terms of its direction cosines, which may be denoted X lp , 
\ZPJ . . . j X rp . Then 

(13) F/A = Vjp = flyiXip + #72X22? + . . . + Q,j r \rp - 



The unknown parameters are the direction cosines of A p , while the values 
of af m are given in the matrix F. The normal equations would be of the form 



(14) *L + 
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with the conditional equation 

(15) 



where ft is a Lagrange multiplier.* These r normal equations are evidently 
non-linear and awkward to solve. 

Since the direct solution of the r simultaneous equations (14) is not feasi- 
ble, the solution will be reached by starting with a trial vector. This vector 
will be adjusted until the normal equations are satisfied. Let A p be an arbi- 
trary trial vector. Then the n values of v 3 - p in (13) may be determined for 
the trial vector. It is desired to maximize u. One of the parameters may be 
expressed in terms of the remaining ones by the conditional equation (15), 
so that the solution may be found in terms of (r 1) independent parame- 
ters. Since the numbering of the orthogonal reference vectors is arbitrary, 
let Aip be expressed in terms of the remaining (r 1) direction cosines of A p . 
By (15) we have 

(16) ZP = Xfp - 

so that 

Ir 11/2 

1 s._.^i *^ m P 

The first partial derivative of VJ P with respect to the independent parameter 
Amp is then 

(18) ~^- - -a^p + a*. , 
where 

mp Aip 

Also, 

dv,'p 

(19) dwfy 



d\np (&jp + <0 2 

* An alternative method of successive approximation is to treat (14) as r linear equa- 
tions in j8 P for r trial values of Xmp with residuals p mp . For each approximation Xmp a new 
trial value ftp is determined so as to mhihnize the r residuals pmp. These vanish for those 
values of Xmp which maximize the function u. 
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From (18) and (19) we have 

~-- 
(20) 

Let 



* 

Then 
(22) 



Since u is the function that is to be maximized, its partial derivatives with 
respect to the (r 1) independent parameters must be found. These deriva- 
tives are in the form 

(23) -JL = ^IX = 2^ 

mp p y*i y=i 

Summing (22) for all tests j, we have, by (23), 

n n, 

(24) 



The numerical values of the (r 1) derivatives of (24) may be determined 
for the trial vector A. P . Let these derivatives be denoted 

du 

(25) Pmp = 



Since the derivatives (24) show the rates at which the function u is increas- 
ing at the point A p with respect to the (r 1) independent direction cosines 
of A p , it is clear that the small corrections to \n p should be proportional to 
p mp . Let the corrections be denoted 

(26) Smp = kp m p , 

where * is a small arbitrary constant. Then 



where i^ P are proportional to the (r 1) independent direction cosines of 
the new trial vector M p . It is advisable to choose k so that none of the cor- 
rections mp exceed .10 or .05. 



ISOLATION OF PRIMARY FACTORS 185 

When the (r 1) values of JLL^ have been computed by (27), the remain- 
ing value pip is determined by the equation 

(28) lp = r 



which is obtained by taking differentials of (16), noting that 
Normalizing the r values ^ p gives the direction cosines of the new trial 
vector M p . 

This procedure is repeated until the (r 1) derivatives p mp all vanish. 
The vector A p for which all the derivatives p mp vanish gives a stationary- 
point in the surface 

(29) u = <j>(\i p , X 2p , . . . , X rp ) . 

It is advisable to choose for X lp that direction cosine which has the highest 
absolute value. 

The r direction cosines of the vector A p for which the function u is sta- 
tionary constitute one of the columns in the transformation G. The order 
of the columns of G is arbitrary. 

It will be found that the corresponding column of the matrix V shows a 
large number of vanishing entries if the variables of R have simple struc- 
ture. In the case of psychological tests, this is in accordance with the hy- 
pothesis that each test in a diversified test battery does not require all of 
the primary abilities which are required by the battery as a whole. If the 
number of columns of F is smaller than the number of primary abilities in 
the n tests and if the primary abilities are involved only positively in the 
tests, then the insufficient number of columns of F will cause negative 
entries in V. 

In evaluating the partial derivatives of (24) and in determining the value 
of u which is to be maximized, it is convenient to have facilitating tables for 
w and for y. Table 7 shows the value of w for each given value of v. Table 8 
shows the value of y for each given value of v. The argument v is listed in 
these tables to two decimals.* 

Analytical method of isolating simple structure. 

The equation of a simple structure has been shown to be of the form 
(7-vi) or of the more condensed form (9-vi). The simple structure is defined 
by the r 2 parameters X P of (7-vi). The best fitting simple structure for a 
particular factorial matrix F may be defined, by the usual statistical con- 

* Special data sheets have been prepared for determining the hyperplanes in which 
the function u is maximized. These are available at University of Chicago Bookstore. 



Table 7 
Values of w 



V 





l 


2 


3 


4 


5 


6 


7 


8 


9 


.00 


100 


99 


96 


92 


86 


80 


74 


67 


61 


55 


.10 


50 


45 


41 


37 


34 


31 


28 


26 


24 


22 


.20 


20 


18 


17 


16 


15 


14 


13 


12 


11 


11 


.30 


10 


9 


9 


8 


8 


8 


7 


7 


6 


6 


.40 


6 


6 


5 


5 


5 


5 


5 


4 


4 


4 


.50 


4 


4 


4 


3 


3 


3 


3 


3 


3 


3 


.60 


3 


3 


3 


2 


2 


2 


2 


2 


2 


2 


.70 


2 


2 


2 


2 


2 


2 


2 


2 


2 


2 


.80 


2 


2 


1 


1 


1 


1 


1 


1 


1 


1 


.90 


1 


1 


1 


1 


1 


1 


1 


1 


1 


1 



V 





1 


2 


3 


4 


5 


6 


7 


8 


9 


.00 


10.00 


9.99 


9.96 


9.91 


9.84 


9.76 


9.65 


9.53 


9.40 


9.25 


.10 


9.09 


8.92 


8.74 


8.55 


8.36 


8.16 


7.96 


7.76 


7.55 


7.35 


.20 


7.14 


6.94 


6.74 


6.54 


6.35 


6.15 


5.97 


5.78 


5.61 


5.43 


.30 


5.26 


5.10 


4.94 


4.79 


4.64 


4.49 


4.36 


4.22 


4.09 


3.97 


.40 


3.85 


3.73 


3.62 


3.51 


3.41 


3.31 


3.21 


3.12 


3.03 


2.94 


.50 


2.86 


2.78 


2.70 


2.63 


2.55 


2.48 


2.42 


2.35 


2.29 


2.23 


.60 


2.17 


2.12 


2.06 


2.01 


1.96 


1.91 


1.87 


1.82 


1.78 


1.74 


.70 


1.69 


1.66 


1.62 


1.58 


1.54 


1.51 


1.48 


1.44 


1.41 


1.38 


.80 


1.35 


1.32 


1.29 


1.27 


1.24 


1.22 


1.19 


1.17 


1.14 


1.12 


.90 


1.10 


1.08 


1.06 


1.04 


1.02 


1.00 


0.98 


0.96 


0.94 


0.93 



c=.01 



Table 8 
Values of y 



V 





1 


2 


3 


4 


5 


6 


7 


8 


9 


.00 





196 


370 


505 


595 


640 


649 


631 


595 


549 


.10 


500 


450 


403 


359 


320 


284 


252 


225 


200 


179 


.20 


160 


144 


129 


116 


105 


95 


86 


79 


72 


66 


.30 


60 


55 


51 


47 


43 


40 


37 


34 


32 


30 


.40 


28 


26 


24 


23 


21 


20 


19 


18 


17 


16 


.50 


15 


14 


13 


13 


12 


11 


11 


10 


10 


9 


.60 


9 


8 


8 


8 


7 


7 


7 


6 


6 


6 


.70 


6 


5 


5 


5 


5 


5 


4 


4 


4 


4 


.80 


4 


4 


4 


3 


3 


3 


3 


3 


3 


3 


.90 


3 


3 


3 


2 


2 


2 


2 


2 


2 


2 



V 





1 


2 


3 


4 


5 


6 


7 


8 


9 


.00 


0.00 


2.00 


3.97 


5.89 


7.75 


9.52 


11.18 


12.72 


14.13 


15.40 


.10 


16.53 


17.51 


18.34 


19.03 


19.57 


19.99 


20.28 


20.46 


20.54 


20.51 


.20 


20.41 


20.23 


19.98 


19.68 


19.33 


18.93 


18.51 


18.06 


17.60 


17.11 


.30 


16.62 


16.12 


15.62 


15.12 


14.63 


14.14 


13.66 


13.19 


12.72 


12.27 


.40 


11.83 


11.41 


11.00 


10.60 


10.21 


9.84 


9.48 


9.13 


8.79 


8.47 


.50 


8.16 


7.87 


7.58 


7.31 


7.04 


6.79 


6.55 


6.31 


6.09 


5.88 


.60 


5.67 


5.47 


5.28 


5.10 


4.93 


4.76 


4.60 


4.45 


4.30 


4.16 


.70 


4.02 


3.89 


3.77 


3.64 


3.53 


3.42 


3,31 


3.21 


3,11 


3.01 


.80 


2.92 


2.83 


2.75 


2.67 


2.59 


2.51 


2.44 


2,37 


2.30 


2.24 


.90 


2.17 


2.11 


2.05 


2.00 


1.94 


1.89 


1.84 


1.79 


1.74 


1.70 
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ventions, as that set of r co-ordinate hyperplanes L p with r 2 parameters for 
which the function <j> in (11-vi) is minimized. For each trait j, let 



(30) w, 
so that 

(31) * 

The elements of the oblique factorial matrix V are 

(32) v ip = 

The r parameters \m P for each co-ordinate hyperplane L p are subject to the 
conditional equation 



(33) z p = > ^> -1 = 0. 

The form of the normal equations is as follows: 



where P P is a Lagrange multiplier for each hyperplane L p . The first term of 
(34) may be written in terms of \np, namely, 



Substituting (32) in (30), the partial derivatives of (30) are 



where q takes all successive integral values from 1 to r, except p. For con- 
venience, let 

(37) */p = v 
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Also, 



Substituting (37) and (38) in (36), 

(39) 

By (35) and (39), 

(40) 



By (33), the derivatives of the second term of (34) can also be written in 
terms of Kn P , namely, 



Substituting (40) and (41) in (34), 

n 

(42) 2]Ta 3 - m s J> + 2/3pX OT1 , = . 

j'-i 

Dividing by 2 and transposing, (42) becomes 

n 

(43) / 0>jmSjp = ~~ ftAmp 

/! 

If the r 2 parameters \m P have been correctly chosen, the-r numerical 
values of the left member of (43) for each of the co-ordinate hyperplanes are 
proportional to the r values of \n p . If arbitrary trial values are chosen for 
the r 2 parameters Xm p , then the normalized left members of (43) define a new 
unit vector M 9 with r direction cosines /z^. The unit vector M P has an 
angular separation from A p of P . If it were desired to maximize the func- 
tion 0, then the r unit vectors M p could probably be used as the new trial 
reference vectors A p . But it is desired to minimize the function <. Hence 
each of the r reference vectors A p may be adjusted in the plane of the angle 
A P OM P by enlarging the angle 6 P . Let the new trial reference vectors be N P 
with direction cosines v mp . These may be used instead of A p in order to re- 
duce the numerical value of the function <f>. Then 



(44) A P OM p < N P OM P . 
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By successive approximation, the function < may be reduced in numerical 
value toward its minimum value <o at which the r 2 parameters \ mp define a 
simple structure. If the simple structure is perfect in the, sense that each 
trait vector is contained in one or more of the co-ordinate hyperplanes, then 
each of the n values w 3 - in (30) vanishes, and hence <j> also vanishes. In this 
case the new vectors N p become indeterminate, since the left members of 
(43) vanish. 

A method of successive approximation for isolating simple structure 

Equation (43) states the condition that is to be satisfied when the correct 
numerical values of the r 2 parameters \ mp have been found. A direct solu- 
tion is not feasible for r 2 parameters with as many non-linear normal equa- 
tions. For computing purposes, a method of successive approximation will 
be described by which the minimum value of the function < may be ap- 
proached with any required degree of accuracy. 

The principle of the method is as follows: An arbitrary set of r reference 
vectors A p is chosen for the first trial. It is convenient to choose the r or- 
thogonal centroid axes for the first trial. Substituting their direction cosines 
in (43) gives the r 2 initial numerical values of the left members of (43). 
There will be r such values for each of the r hyperplanes L P . The r trial vec- 
tors A p are to be adjusted so as to reduce the function <t> toward its minimum 
value. 

Let the direction cosines of the r trial vectors A p be denoted \n PJ and let 
the corrections be denoted dXm p . The direction cosines of the resultant vec- 
tors N p are proportional to 

(45) Vmp = \np + d\np . 

When the vectors N p are normalized, they are reduced to unit vectors M p 
with direction cosines 

(46) Vmp = HWW 

The vectors M p are the new trial vectors. 

It is desired to choose small corrections so that 

(47) *G*mp) < *0w) - N , 

By successive approximation the function <j> is to be minimized, subject to 
the conditional equation (33). The new trial vectors M 9 have direction 
cosines 

(48) Mmp = OW 
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The inequality (47) will be satisfied if 

(49) d*(Xp) < , 
subject to the conditional equation 

(50) dz p = . 
By (31) 

(51) 

For convenience, let 

(52) 

a 

where q takes all integral values from 1 to r, except p. Then 

r r 

(53) duo, = 2^ ^^ 

pi m=l 

But 

(54) s/p = 

Hence 

(55) dto,* = 
and by (51) 

(56) d* - 2 

The conditional equation (50) can be expressed in terms of X mp by (33). 
Then 



n r r 



(57) dz p ^ ^Knpd\mp = . 

m=l 

Let \ T p 7* 0. Then 

1 ^ 

(58) d\ T p = - ^"ZJ 

rp m=l 
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Substituting (58) in (55), 

(59) di 

and hence 
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p=l m=l 



Here the differential dcf> is expressed as a sum of r 2 terms, namely, r terms 
for each of the r hyperplanes. Each of these terms is of the form 



ajrSjp dKmp . 




The r terms in which m = r vanish identically. The two conditions (49) and 
(50) are satisfied if each d\m P is so chosen that each of the non-vanishing 
terms of (60) is made negative. This can be accomplished if each of the 
corrections d\m p (excepting d\ rp ) is taken with sign opposite to 



When the corrections for (r 1) direction cosines of each of the r hyper- 
planes have been chosen (excepting dX rj ), the remaining rfch correction for 
each hyperplane is determined by (58). In the method of successive ap- 
proximation which is to be described, each correction d\m P will be taken 
proportional to the corresponding term in (56) with reversed sign. 

The direction cosines of the new r trial vectors M P may be determined in 
the following manner. Let 



(61) 



C mp 



where m^r. The value of c rp is determined by the relation 



r-l 



(62) 



The corrections dX^p will be taken proportional to 
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Let the r direction cosines of each of r vectors N p be proportional to 

(63) Vmp == Amp ~T~ fcpCmp j 

where k is so chosen that the maximum value of any one of the r 2 terms k p c mp 
is equal to some assigned value such as .10 or .05. The new trial vectors 
Mp have the direction cosines 

(64) Vmp = WlVmp , 

where the constant m is so chosen that M p is a unit vector. Hence 



(65) 

Then by (64), 

(66) 

or 

(67) 



Instead of estimating the magnitudes of the corrections for the direction 
cosines of the vectors A p by choosing a maximum value for k p c mp , it may be 
desirable to estimate the magnitude of the angular displacement between 
the given trial vectors A p and the next trial vectors M p . Let this angular 
displacement be 6 P . The angles P may be determined as follows : The cosine 
of the angle B p between the unit vectors A p and M p is 



(68) cos0 p 

m=l 

By (48) 

Mmp = m(X mp 

Hence 



(69) cos P = m > X mp (X mp + 

m=l 

or 

(70) cos e p = 
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Since A p are unit vectors, 

(71) 

and hence 

r 

(72) cos 0j, = m + m^^XmpdXjnp , 



But 

r 

(57) dz p = ^W&mp = . 

w=l 

Hence 

(73) cos 6 P = m , 

or, by (67), 

COS dp 




(74) 

By (45) 

(75) _ _ __ 

m=l m=l m=l w=l 

By (57) and (71) 
(76) 



m=l m=l 

or 

(77) cos 6 P =- 




7tt=l 

and hence 



(78) 
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which can be written in the form 



(79) 

m=l 

By (45) and (63) it follows that 



(80) 

so that 
(81) 



If a small angular displacement 6 P is specified, the corresponding value 
of fop is determined by (81). 

It is probably best to choose \ rp as that direction cosine X mp for each hy- 
perplane L p for which c mp is the largest. The values of c mp vanish for those 
trial values of X^ which minimize the function < (X m3 ,) . Their absolute values 
serve to indicate the rapidity with which the minimum value of <t>(\m p ) is 
being approached. 

Numerical example of method of successive approximation 

The method of successive approximation for determining a simple struc- 
ture will be illustrated by a numerical example of four points in a plane. 
The four points are shown in Figure 2. They were arranged in two groups 
of two points in each group. The two reference vectors A p of the best- 
fitting simple structure will then necessarily be orthogonal to radial lines 
that pass through or near the two groups. Each trial consists of computa- 
tions that are illustrated by Tables 9 and 10 for the first trial. All of the 
trials are summarized in Table 11. 

In Table 9 the four points are numbered in column j. The two co-ordi- 
nates a 3 i and a 3 - 2 for each of the four points are shown in the next two col- 
umns. The values of VJ P are shown in the next two columns. Since the first 
trial vectors A^ are taken as unit vectors along the two orthogonal reference 
axes, the initial values of a } - m and v 3 - p are identical. The values of a/ m and v 3 - p 
are different in all subsequent trials. The resulting values of w,- are shown 
in the next column. The sum of this column is the initial value of the func- 
tion <t>(\m P ) which is to be minimized. The next two columns for s$ p facilitate 
computation of the values of c mp . The last column is a check column. 
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Section A ms of Table 10 shows the numerical values of the left members 
of (43). They are obtained directly from Table 9. Section Cm p was computed 

I 
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.8 
.8 
.3 
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.7 
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.8 
.8 
.3 
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.3 

.4 

.1 
.8 


.0576 
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.0256 
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by (61) and (62). The next section (d>^ P ) was computed with such a multi- 
plier k that the maximum correction was equal to an assigned value which 
was reduced for each trial. In the first trial this maximum value of the cor- 
rection was .25, and it was denoted e. The next two sections are self-explana- 
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tory. The numerical values of the direction cosines of the two new trial 
vectors M p are recorded in Table 11. 



Cmp 



d\mp = kpCmp 



Table 10 



.22970000 
.27810000 


.38370000 
.22970000 


.50780000 


.61340000 



.00000000 - .38370000 

- .27810000 .00000000 

.00000000 - .25000000 

- .18119625 .00000000 k p + . 65155069 



1.00000000 - .25000000 

- .18119625 1.00000000 

.98397744 - .24253562 m 1= .98397744 

- .17829302 .97014250 m 2 = .97014250 



Table 11 



Trial 


<f> 


e 


\i 


\i 


\* 


^ 


1 

2 
3 
4 
5 
6 
7 


.22970000 
.03658568 
.00534688 
.00450116 
.00442401 
.00442241 
00442213 


.25000000 
.14439991 
.03000000 
.01000000 
.00100000 
.00070000 


1.00000000 
,98397744 
.96391815 
.95544072 
.95450906 
.95419615 
.95397641 


.00000000 
-.17829302 
-.26619880 
-.29518304 
-.29818189 
-.29918173 
-.29988165 


.00000000 
-.24253562 
-.38271927 
- .41250184 
-.40247759 
-.40314454 
-.40320642 


1.00000000 
.97014250 
.92386469 
.91095677 
.91542984 
.91513632 
.91510906 

















The new trial reference vectors reduce the function <. In the second trial 
k is taken as unity, since the maximum value of Cm P is of the right order of 
magnitude. The maximum correction is then .144400. Table 11 shows the 
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maximum correction e for the direction cosines of the trial reference vectors, 
the resulting reference vectors, and the corresponding reduced value of the 
function $. If the maximum correction e is taken too large, the fact will be 
known by the rise of the function. 

This example has been carried to a higher degree of accuracy than will 
be expedient in most scientific problems. When the direction cosines of the 
last trial vectors are substituted in (43), it is found that they are propor- 
tional to the left members of (43) with a maximum discrepancy which is 
less than .002. This proves that the minimum value of < is reached. It is 
represented by the reference vectors A p in Figure 2. 

Comparison of methods 

All of the methods described in this chapter have been tried on actual 
psychological test data. It seems conclusive that the best method for most 
psychological problems is the method represented by equation (12) with 
Tables 7 and 8 to facilitate the numerical work. Analytically, the method of 
(43) is the most interesting; but it is applicable with success only to a perfect 
simple structure which cannot be expected in any experimentally obtained 
data. 

The analytical method of (43) can probably be modified so as to give as 
satisfactory results as that of (12) by a proper choice of the function w,- 
in (30). In that equation the second power of VJ P is used. It now seems cer- 
tain that a requirement of the function w(v) is that the absolute value of its 
first derivative, dw/dv, must vary inversely with v except for small values of 
v in the range .10, where the function should be more stable. This require- 
ment is satisfied by equation (12). The requirement would also be satisfied 
by the analytical method of adjusting all of the co-ordinate hyperplanes 
simultaneously if it were modified so that 

(82) 1 

where n is an integer so chosen that the exponent is a small fraction. If it 
is desired to avoid having the derivatives become infinite when VJ P is zero, 
that can be accomplished by adding an arbitrary constant c, so that the 
function then becomes 

(83) Wl - 

The theoretical interest of the analytical method in which all of the 
hyperplanes are adjusted simultaneously should not be adequate reason for 
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accepting a solution which fails to maximize the number of vanishing entries 
in V. It seems reasonably certain that the method of equation (12) by which 
each hyperplane is separately adjusted does maximize the number of vanish- 
ing entries in V. 

A consideration of primary importance in the determination of simple 
structure is the fact that its essential feature is a configuration. A require- 
ment is that each trait vector shall lie close to one or more of the co-ordinate 
hyperplanes, but factorial analysis does not necessarily involve any assump- 
tion as to which hyperplane shall contain particular trait vectors. The sta- 
tistical methods become applicable only when enough of the simple-struc- 
ture configuration has been gleaned so that assumptions can be made re- 
garding the groups of projections that are to be minimized. 

One of the numerous methods that have been tried is to locate each hy- 
perplane so as to minimize the sum of the absolute values of the projections 
\Vj p \. With slight modification this simple method is now being used suc- 
cessfully. 



CHAPTER VIII 
THE POSITIVE MANIFOLD 
Restrictions on the factorial matrix 

The scientific problems to which the factor methods are applied may re- 
quire different restrictions on the elements of the factorial matrix. Several 
of these restrictions have already been discussed, and additional ones will be 
described in this chapter. Some of these restrictions may be considered 
under four cases, as follows : 

1) The simplest case is that in which the factorial matrix F can be used 
as determined by the centroid method, or by any other equivalent method, 
without restrictions beyond those that are inherent in F. It is probably 
seldom that a scientific problem can be adequately solved without some re- 
strictions on the elements of F. 

2) One form of constraint that is of very general scientific interest as 
regards the factorial matrix is that of simple structure. It seems probable 
that this constraint will be almost universally imposed in order that the 
scientific interpretation of the factorial matrix shall be convincing. 

3) If the scientific problem is such that negative cell entries in F are 
excluded, then we have the important case of a simple structure in which 
a?m^0. This is the assumption that underlies the application of factorial 
methods to the problem of isolating primary mental abilities; but the as- 
sumption is not absolutely necessary, since ideal constructs can be devised 
for a science of psychology which do not require that the cell entries of F, 
or those of the oblique factorial matrix V, be positive or zero. 

4) A special case of the positive entries of F is the further restriction that 
each factor of F, or of F, be either completely present or completely absent 
in each test. This is a case of possible interest in genetics, but it is not likely 
that it will be directly applicable to scientific data without admitting a 
specific variance for each variable. 

The first two cases have already been discussed. The last two cases will 
be described in this chapter. If all of the elements of V are positive or zero, 
then each column of "F is defined by a positive hyperplane so located that all 
of the trait vectors which are not contained in it are on the same side of it. 
If it is assumed that all of the factors have positive or zero contributions to 
each variable, then all the trait vectors are in the positive region. The 
bounding planes of this region are then of special interest. 
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Definition: The orthogonal hyperplanes which bound the positive region in 
r dimensions will be called the orthogonal positive manifold. 

Definition: A set o/r distinct and oblique positive hyperplanes for a trait 
configuration in r dimensions will be called an oblique positive 
manifold. 

An oblique positive manifold is not necessarily confined to the positive 
region. 

Definition : // the factorial matrix of the traits which are contained in a posi- 
tive hyperplane is of rank (r 1), then the hyperplane is a bounding 
hyperplane or a positive co-ordinate hyperplane. 

In the oblique factorial matrix V the elements of each column are the 
distances of the traits from a co-ordinate hyperplane. The factorial matrix 
V, or a corresponding matrix F, for those traits that are contained in one of 
the oblique co-ordinate hyperplanes is of rank (r 1) . A positive hyperplane 
can easily be located so that all of the trait vectors are either contained in 
it or on the same side of it, but it would not necessarily constitute a positive 
co-ordinate hyperplane. The trait vectors which are contained in it may be 
of rank less than (r 1). A positive hyperplane may be determined so that 
only one trait vector lies in it, and evidently it would not be a unique co- 
ordinate hyperplane. However, if the rank of a factorial matrix for the 
trait vectors which are contained in such a hyperplane is (r 1), then the 
hyperplane is quite likely to be scientifically significant as a positive co- 
ordinate hyperplane. If, in addition, the criteria of simple structure are 
satisfied, then the reference traits determined by the intersections of the r 
co-ordinate hyperplanes are almost certain to be scientifically significant 
categories of reference. 

If a correlational matrix is of rank r 2 and if the factorial matrix has been 
computed to r\ factors where r x <r2, and if the trait configuration can be in- 
scribed in the positive manifold in ra dimensions, then it may be expected 
that the trait configuration in r\ dimensions cannot be inscribed in the posi- 
tive manifold of m dimensions. In such a situation it is a matter of judgment 
whether the number of columns of F has been extended far enough. It is in 
the nature of most scientific problems in which the factor methods are likely 
to be called upon that all of the common factors of minor significance cannot 
be extracted. Those common factors which contribute only slightly to the 
variance of several traits cannot be differentiated with certainty from the 
variable errors. With only slight representation in the traits they cannot be 
identified and named with any degree of confidence. Hence it seems useless 
to carry the columns of F until the residuals are comparable with the known 
order of magnitude of the errors in the given correlation coefficients. Since 
the computations of successive factors must stop before the residuals reach 
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the order of magnitude of the variable errors in the correlations, it seems 
necessary to depend on judgment as to when the process should be discon- 
tinued. The practical criterion might be adopted that the factors should be 
extracted until they cease to be meaningful; but interpretation is not feasi- 
ble until the factors have been rotated, even if an orthogonal system of ref- 
erence traits is acceptable. 

No complete analytical method is available for locating the positive 
manifold, but several methods of investigating it will be described. If a sim- 
ple structure is found to exist in the trait configuration, then it may also 
happen that it can be inscribed in the positive manifold. If this situation is 
discovered, the simple structure in the positive manifold is especially con- 
vincing and the oblique factorial matrix is then almost certain to be scientifi- 
cally meaningful. It may happen that the problem is of such a nature that 
simple structure is not to be expected but that the inscribing of the trait 
configuration in a positive manifold would be meaningful. Then the positive 
manifold is the means of locating a unique set of reference axes, either 
orthogonal or oblique, which may be scientifically significant. 

The elimination of negative factor loadings 

The orthogonal transformation by which a factorial matrix F may be 
rotated into an orthogonal positive manifold F P contains 1/2 r(r 1) inde- 
pendent parameters, where r is the number of columns of F. Each one of 
these independent parameters may be thought of as determining an angle 
of rotation < for a pair of columns of F. There are as many independent 
parameters in an orthogonal transformation in r dimensions as there are 
pairs of columns. If any two columns are plotted on cross-section paper, 
the point that deviates farthest from the centroid may be brought into one 
of the two orthogonal axes by means of a rotation. This procedure may be 
continued with successive pairs of columns until all the elements of F p are 
positive or zero if the trait configuration can be inscribed in the orthogonal 
positive manifold, 

If the elements of F p are theoretically positive or zero, it is to be expected 
that the variable errors will cause the theoretical zero elements to be higher 
or lower than zero. Small negative elements may therefore be expected in 
a factorial matrix which is theoretically positive. 

The problem of finding an oblique positive manifold that will circum- 
scribe the trait configuration can sometimes be solved approximately by a 
procedure not unlike that in which clusters are isolated. This procedure is 
based on the principle that those trait vectors which lie in or near the inter- 
sections of several of the bounding co-ordinate hyperplanes must have a 
maximum, number of nearly vanishing correlations with the other traits. 
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The procedure is to select r such traits for which the number of low or nearly 
vanishing coefficients is maximized. Make a list of more than r traits with 
unusually large numbers of low coefficients. Arrange them in a square corre- 
lational matrix. Eliminate traits, as for the isolation of clusters, except that 
in this case it is a set of r trait vectors with the lowest possible intercorrela- 
tions that is sought. When this set of r trait vectors has been selected, a trial 
set of co-ordinate hyperplanes may be determined by taking r sets of (r 1) 
of these relatively uncorrelated extreme traits. These r co-ordinate hyper- 
planes may be adjusted by the method of maximizing the number of vanish- 
ing projections v/ p with due regard, in this case, also to the negative pro- 
jections which are to be eliminated or reduced to values near zero. This 
method has been tried with some success, but its applicability can never be 
guaranteed unless it can be safely postulated that the trait configuration 
can be inscribed in a positive manifold. 

It must be recalled that even if all of the original intercorrelations are 
positive or zero, it does not follow that the trait configuration can be in- 
scribed in a positive orthogonal manifold. However, if the given correlation 
coefficients are all positive or zero, or if all the negative coefficients are near 
zero, then the existence of a positive manifold, as bounding planes for the 
configuration, is a plausible hypothesis. If the given correlational matrix 
contains negative coefficients that cannot be made positive by reflection, 
then the existence of a bounding positive manifold is definitely excluded. 

One type of solution to the problem of locating an existing bounding posi- 
tive manifold which has not yet been adequately investigated is based on 
the principle that the partial correlation coefficient is represented geometri- 
cally as the cosine of a dihedral angle. Consider a reference trait vector T 
and the two planes determined by the pair of vectors T and j and the pair 
of vectors T and k, where j and k are any two trait vectors. If the cosines 
of a large number of the dihedral angles jTk are near unity or near zero, then 
the reference vector T is likely to be the intersection of a set of (r 1) posi- 
tive co-ordinate hyperplanes of a simple structure. 

It is occasionally of some interest to determine one or more positive hy- 
perplanes even though they may not be bounding hyperplanes. In Figure 1 
let A be any trial unit vector in the common-factor space, and let u be the 
trait vector that has the largest negative projection c on A. Let a; be a vector 
which is coplanar with A and with u. The direction cosines of A are known, 
and it is desired to find the direction cosines of x which are orthogonal to u 
so that the projection of u on x is zero. The vector x will constitute the next 
trial vector. 
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Since x is coplanar with u and A, the direction cosines of x can be ex- 
pressed as linear functions of the direction cosines of u and of A. Therefore 



(1) 



+ 6X1 



x r = au r 



where a and b are the parameters to be determined, while u m and \ m are the 
direction cosines of u and A. When the parameters a and b have been found, 




FIGURE 1 

they can be used to determine the direction cosines of the vector x in the 
common-factor space of r dimensions which contains u and A. 
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The determination of the vector x is subject to the conditional equation 



(2) XMn = , 

w=l 

since x and u are statistically independent. Substituting (1) in (2), 

r 

(3) ^u m (au m + b\n) = , 

w=l 

which, after expanding and combining terms, becomes 



/y4\ ^. *> I "L ^. X f\ 

(4) a 2^^ + o2Lj U *^ m ** ' 

ml m=l 

But 



(5) 2* u " = h *> 

m=l 

where A 2 is the communality of the trait u, and 
(6) 



where r wX is the correlation between the trait u and the trial vector A. 
The correlation r vX can also be written as a scalar product. Hence 

(7) r u \ = h cos < = c . 
By (5), (6), and (7) equation (4) becomes 

(8) aA 2 + be = , 
and hence 

(9) 6 =-~- 

Since x m are the direction cosines of the unit vector x, 

(10) 

^i 
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Substituting (1) in (10), 



(11) o 

m=l m1 m=l 

Since A is a unit vector, we have by (5), (6), and (7), 

(12) a 2 A 2 + Z> 2 + 2abc = 1 . 
Substituting (9) in (12), 

(13) 

and hence 



Substituting (14) in (9), 
(15) 



The parameters a and 6 are expressed in terms of the communality of u and 
the negative correlation r uX . 

Applying these two parameters to the determination of the direction co- 
sines x m of x, we have 



(16) x m ait + 

by which the new trial vector x can be determined in the common-factor 
space of r dimensions. The projections of the traits in the battery on x are 
then found. If a significant negative projection exists, it is treated in the 
same manner as r u ^ = c until the direction cosines of a positive hyperplane 
have been reached. This method is quite simple in application. 

Unitary factors 

A special case of the positive manifold is that in which each reference trait 
is either completely present or entirely absent in each member of the popu- 
lation N. Such reference traits may be called unitary factors in the sense 
that the raw scores in the dichotomous distribution of such a trait are either 
+1 or for each member of the population. The corresponding standard 
scores in a unitary trait have only two numerical values in the population. 
These two values depend on the number N e of individuals who possess the 
unitary trait and on the number M e in whom the trait is absent. 
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It is a legitimate hypothesis that the intellectual and emotional traits of 
people can be reduced eventually to genetic origin. It seems likely that at 
least some human traits will be expressed in terms of unitary component 
elements which may be Mendelian in character. If that is to be the eventual 
outcome, then mental traits will be conceived as the resultant of a group of 
genetic unitary factors; and, in this context, the complexity of a trait will 
be the number of unitary factors that demonstrably contribute to the vari- 
ance of the composite trait. Let n 3 - and Uk be the number of unitary factors 
that contribute to the variance of the composite traits j and &, respectively. 
It is not to be expected that these unitary factors or genes have equal im- 
portance in determining a composite trait. Hence some system of weighting 
each unitary factor seems essential in expressing the total variance of a 
trait j in terms of the n 3 - unitary factors that define j. 

It also seems certain that here, as elsewhere in science, the primary causes 
do not combine in the manner of a weighted sum to produce the composite 
traits but that non-linear and discontinuous functions are involved. At the 
present time little is known about the unitary factors that combine to pro- 
duce the observable human traits; little is known about the complexities, 
rij, of these traits and the functions by which the unitary factors combine 
their effects to produce the composite traits. 

It would seem unduly pessimistic to withdraw from the problem with a 
conviction that it cannot be solved. The present factorial methods are 
based on the hope that in some scientific problems, but not necessarily in 
all of them, a linear combination of factors may serve the purposes of a first 
approximation and that features of the problem will be revealed by these 
simple methods that would otherwise remain unnoticed for a long time. 
The correlation coefficient is itself a symbol of defeat in that its computation 
is an admission of ignorance about the underlying rational equation. Who 
would ever compute the correlation coefficient between the length of a pen- 
dulum and its period? It could be done by observing the period of each of 
one hundred pendulums of different lengths. But if the customary equa- 
tions were unknown, the correlational method certainly would enable us to 
establish experimentally that there is an inverse relation between the length 
of a pendulum and its period and that there is no relation between the 
weight of the pendulum and its period within wide limits of weight. Such 
facts would be food for speculation concerning the non-linear functions that 
describe the phenomena more accurately. The factor methods are in a simi- 
lar situation in that they will certainly be discarded eventually for each 
class of phenomena when they have served the purpose of revealing some 
of the significant relations. 

In order to illustrate a type of factor analysis that may prove significant 
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in the future, the correlation between two composite traits will be considered 
as a function of the unitary elements, factors, or genes, which may be re- 
garded as primary in this context. In order not to obscure the present pur- 
pose, the analysis will be made with simplifying assumptions that may ac- 
count for experimental observations only as a first approximation. 

Let the two composite traits be j and k, and let there be n$ unitary ele- 
ments in j and Uk elements in k. Let there be N e individuals in the popula- 
tion N who possess a particular element e, and M e individuals in whom the 
element is absent. Let u e be the standard score of every individual who 
possesses the unit factor, and let v e be the standard score of each individual 
in whom the unit factor is absent. Then, by definition, 

(17) N e + M 6 = N . 
Since u e and v e are standard scores, Xi e , 

N 

(18) ^?Xie = N U e + M e V e = , 
i=l 

so that 

(19) "'--If.*- 
For the same reason, * 

N 

(20) 2H " NeUl 
i=i 

Substituting (19) in (20), 



e = N e (M e +N e ) > 
and by (17), 



- 

* N e * 

Let 



N e , 

* - and 



Then 
(23) 
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By (23) and (19), 

(24) v\ = | , 
so that 

(25) uft - + 1 . 

It is evident from (19) that u e and v e are of opposite sign. Then u e may 
be taken positive and v e negative, since u e is the standard score of an individ- 
ual possessing the unitary trait while v e is the standard score for its absence. 

The correlation between two composite traits j and k can be expressed 
as follows. Let da be the raw deviation score of individual i in test j, and 
let it be assumed that d 3 -i is the sum of the standard scores of individual i in 
those unitary traits which are involved in j. 

Let Wtje Uie if the unitary element e is inj" and if the element e is present 
in individual i. 

Let wye Vie if the unitary element e is in j and if the element e is absent 
in individual i. 

Let Wije Q if the unitary element e is not in j. 

Then 

(26) df { 



The standard score s/ is related by a constant multiplier 6 to the deviation 
score dji. Hence 

(27) a,-* = bdji 



To determine the multiplier 6, the squares of the standard scores $,- may 
be summed. Then 



(28) 

But the elements are assumed to be uncorrelated in the population. Hence 

N 

(29) 
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If an element e is in j, then wa^x^ where x ie is the standard score of in- 
dividual i in the unitary trait e. Its numerical value is either u^ or v ie , de- 
pending on whether the element e is, or is not, present in individual i. Then 

N N 

(30) Vfc. = Vz? e = N . 



Substituting (30) in (29), 

(31) N = 



The summation of the constant N over the elements e covers n/ elements. 
Hence 



(32) 1 
or 

(33) 6 

Substituting (33) in (27), 

(34) Sj* = -7= da , 

K n, 

and, by analogy, 

(35) SM 



The correlation between the two composite traits j and k is 
(36) r^ = 



Substituting (34) and (35) in (36), 



Substituting (26) in (37), and ignoring vanishing cross products, 
(38) ^ 
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The product w&Wite is equal to z& if e is in both j and k and if e is present 
in L It is equal to v\ e if e is in both j and k and if e is absent in i. It vanishes 
if e is absent in j or in /b or in both. The cross products vanish because the 
elements are assumed to be uncorrelated. Since u ie and v ie are both stand- 
ard scores, 

N 
(39) 

if e is in both j and k. Then 




The summation of the constant N is here over the elements that are com- 
mon to j and k. Hence 

(41) 

where n& is the number of elements that are common to j and k. 

This well-known formula for the correlation coefficient expresses the cor- 
relation in terms of the number of unitary elements n/ that are involved in 
the composite trait j, the number of unitary elements n k in fc, and the num- 
ber of unitary elements n& which are common to j and k. 

In case the two composite traits j and k are of equal complexity as regards 
the unitary factors, so that n^n k =ni y then the formula reduces to the 
still simpler form 

(42) T* ^ , 

in which the correlation coefficient is interpreted directly as the ratio of 
common elements in j and k, 

Equations (41) and (42) must be interpreted in the light of the simplifying 
assumptions that the unitary elements are equally weighted in their con- 
tributions to the variance of the composite traits and that they are statis- 
tically independent as regards their incidence in the population N. 

A type of factor analysis may be developed from this conception of the 
correlation coefficient in that the three numerical values n/, n*, and n# are 
all necessarily integral. It follows that, for a finite battery of traits with 
limited complexities, the frequency distribution of correlation coefficients 
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must show discontinuities, and even multimodality. It is conceivable that , 
these multimodalities and discontinuities in the correlation coefficients may 
be used in an inverse process of reasoning whereby they become the experi- 
mental evidence for making inferences about the complexities of the com- 
posite traits and about the number of unitary elements that the traits have 
in common. This type of analysis will undoubtedly proceed by investigat- 
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1.000 
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.707 
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1 


5 


.000 


.447 
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.000 


.378 
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.000 


.267 


.535 




2 


8 
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100 
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.300 
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.500 
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.700 


.800 


.900 


1.000 



ing the frequency distribution of coefficients separately for each column of 
a correlational matrix. These coefficients may be considered in their original 
form or after correcting them for attenuation or for uniqueness. 

Table 1 has been prepared for the purpose of illustrating further the dis- 
creteness of the numerical values of the correlation coefficients that can be 
obtained under the assumptions of equation (41). The interpretation 
of the table can be illustrated by an example. Let two composite traits 
have complexities of 4 and 5, respectively, so that one of them is determined 
by four unitary elements and the other by five unitary elements. Then the 
only possible correlations between the two composite traits are .000, .224, 
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.447, .671, and ,894, depending on whether they have 0, 1, 2, 3, or 4 unitary 
elements in common. 

But while these unitary elements may be acknowledged to be a worthy 
objective, it must not be assumed that the larger and cruder categories will 
then vanish in significance. It is still useful to speak of a man's arms and 
legs even though much is known about the hierarchy of their parts and ele- 
ments. Even if hundreds of unitary and elemental factors should eventually 
be discovered to be primary determiners of intellectual endowment, it 
might still be useful to retain such categories as verbality or visual imagery 
if they demonstrably simplify our comprehension of mental endowment. 



CHAPTER IX 

ORTHOGONAL TRANSFORMATIONS 
Rotation in three dimensions 

In the previous chapters the theory of multiple-factor analysis has been 
discussed, including the two cases of orthogonality and obliqueness of the 
co-ordinate axes. While it is probable that most scientific problems will 
require the more general oblique co-ordinate axes, it is always of interest 
to inquire whether the fundamental categories which are represented by the 
co-ordinate axes may be regarded as statistically independent. In this case 
the co-ordinate axes are orthogonal and the principal problem is then re- 
duced to that of finding the orthogonal transformation by which the trait 
configuration of F can be rotated into a simple structure. In investigations 
where the co-ordinate axes may be expected to be orthogonal, it is con- 
venient to deal with the rotational transformations in terms of the smallest 
possible number of parameters. A rotational transformation in a space of 
r dimensions is represented by a square matrix of order r, so that there are 
r 2 parameters to be determined; but these are not all independent. In this 
chapter several methods will be described by which a rotational transforma- 
tion of order r may be handled in terms of independent parameters. This 
considerably reduces their number, and it avoids the inconvenience of han- 
dling conditional equations. The principles will be described first for a rota- 
tional transformation in three dimensions and in four dimensions; but the 
methods are entirely general, so that they may be applied in a space of any 
number of dimensions. 

Let the given co-ordinates of the points a be o-i, 02, as, and let these points 
be subjected to a rotation. Let the new co-ordinates of the same points be 
AI, A z , As. The change from one set of co-ordinates to the other can be 
described by the orthogonal transformation 



A% = 0,1X12 + atfCsz + 0,3X3% 

A* = 



Here AI, A Zj A$ represent the new co-ordinates of a point a, while the given 
co-ordinates of the same point are ai, 02, a*. The nine x values constitute 
the nine parameters which define the orthogonal transformation. 
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The transformation may be represented in matrix form more briefly thus: 
(2) A = aX , 

where X is an orthogonal matrix. The determinant of the third-order ma- 
trix must be +1, since the present problem concerns only rotation without 
reflection. Hence 

' #11 #21 #31 



(3) 



#22 #32 



#13 #23 #33 



= +1. 



The nine parameters in this matrix are not independent. They must satisfy 
the following six conditional equations : 



(4) 

and 

(5) 



#21 ~T~ #22 ~T" #23 ~ * j 
#31 + #32 + $3 = 1 , 

#12#22 + #13#23 = , 
#12#32 + #13#33 



#22#32 + #23#33 



, 
. 



With nine parameters and six conditional equations there are only three 
independent parameters which determine a rotation in three dimensions. 

In order to avoid the use of nine parameters with six conditional equa- 
tions, the three Eulerian angles may be used as the three independent par- 
ameters. These are as follows:* 

ai = AI(COS <t> cos \l/ sin sin \f/ cos 0) 

(6) A 2 (cos <t> sin i/' + sin <t> cos ^ cos 0) 

+ Az sin <f> sin 6 , 
02 = Ai(sin < cos ^ + cos <j> sin ^ cos 0) 

(7) A 2 (stn <f> sin \f/ cos <j> cos ^ cos 0) 
Az cos <f> sin 6 , 

(8) a 3 = Ai sin ^ sin 6 + A% cos ^ sin 6 + A 3 cos . 

* Virgil Snyder and C. H. Sisam, Analytic Geometry of Space (New York: Henry Holt 
& Co., 1914), chap. iii. 
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In this transformation there are only three parameters, namely, the three 
angles <, $, and 0; but the transformation is nonsymmetric. In actual com- 
putation the three cosines might be regarded as independent parameters; 
but then the three sines are dependent parameters, so that a transforma- 
tion by the Eulerian angles involves, in effect, six parameters with three 
conditional equations. 

If this method is to be generalized to hyperspace, it is of interest to know 
the relation between the rank of the correlational matrix and the number of 
Eulerian angles, or other independent parameters, that will be needed to 
determine a rotation in more than three dimensions. The number of inde- 
pendent parameters for a rotation in r dimensions is |r(r 1). A rotation 
in one plane can be effected by disturbing only two columns hi the factorial 
matrix. The number of possible pairs of columns is |r(r 1), and these ro- 
tations would seem to be independent. In the special case where r=3 this 
gives three independent parameters such as the three Eulerian angles. 
In order to determine a rotation in four dimensions, we should have six in- 
dependent parameters or Eulerian angles. 

If the matrix F has rank 3, then its rotation will involve three dimen- 
sions. Since this can be effected by three independent parameters, it is 
desirable to have a transformation with not more than three parameters so 
as to avoid conditional equations. But there are other requirements that 
are more essential for convenience in computation. It is sometimes possible 
to effect a rotation of the factorial matrix on the basis of scientific hypothe- 
ses that can be tested. The fine adjustment of the rotation is in effect an 
infinitesimal rotation, and it will be convenient to have an orthogonal trans- 
formation in which the parameters become infinitesimal when the rotation 
is infinitesimal. In some situations it will also be convenient to start with 
trial values of the parameters and to solve for the corrections to these 
trial values. This can be done by means of linear simultaneous equations 
if second and higher powers of the corrections can be ignored. But that is 
feasible only if the parameters are themselves fractional less than unity 
for any rotation. The most convenient form of orthogonal transformation 
seems to be one which satisfies the following requirements: 

1) It should be possible to generalize the orthogonal transformation to 
any number of dimensions, 

2) The parameters should become infinitesimal when the rotation is in- 
finitesimal, 

3) The parameters should be fractional for all rotations, 

4) The number of parameters should be as small as possible so as to re- 
duce to a minimum the number of conditional equations that are required 
for numerical work. 

An orthogonal transformation will be described that satisfies all of these 
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requirements except that conditional equations for finite rotations are not 
eliminated. The entries of an nX3 factorial matrix may be thought of as 
the three co-ordinates of each of n points. Let the three columns represent 
the three axes. If such a matrix is rotated about the first axis, it is clear that 
the first co-ordinate of each point remains unchanged while the second and 
third co-ordinates are changed. This rotation can be represented by an angle 
a at the origin in the 2-3 plane. The transformation may be denoted X, 
and it is 



(9) 



X 











cos a sin a 
sin a COS a 



It will be convenient to adopt another notation for the trigonometric 
functions. Let 

2/i = cos a , 



sin a . 



Then the orthogonal transformation becomes 

100 



(10) 



X = 



+2/1 a?i 







+2/1 



This rotation is represented by the matrix equation 



(11) 



aX , 



by which the co-ordinates a are changed to the co-ordinates 6. 

The first rotation a is in the 2-3 plane, while the first co-ordinate remains 
unchanged. The second rotation may be taken in the 1-3 plane, leaving 
the second co-ordinate of 6 unchanged. We then have, by analogy, 



(12) 



2/2 Xz 
010 

22 2/2 
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This rotation may be written in matrix notation as 
(13) c = W , 

to represent the change in co-ordinates from 5 to c. 

The third rotation is then in the 1-2 plane, which leaves the third co- 
ordinate in c unchanged. It is represented by the analogous transformation 



(14) 



Z = 







2/3 





This rotation is shown in matrix notation by the transformation 
(15) A = cZ . 

Summarizing the three rotations 11, 13, 15, we have 



(ID 

(13) 
(15) 
from which we have 



6 = aX , 

c = bY, 

A = cZ , 



. = cZ , 

(16) = bYZ , 

=aXYZ . 

Let u be the matrix product of the three transformations. Then 

(17) u = XYZ, 
so that 

(18) 



A 



an , 



where u is an orthogonal transformation which changes the co-ordinates of 
the n points from a to A. 
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Since the transformations Z, 7, and Z are orthogonal, their product is an 
orthogonal transformation. The criterion of orthogonality of a matrix Z 
is that* 
(19) Z- 1 = X' , or XX' = 1. 

Then a second orthogonal matrix 

F-i = F , or 77' - I . 
The product 



Then 



D = XY. 



and 

But 

Hence 

But 

Hence 



DD' - XYY'X' . 
YY' = / . 
DD' = ZZ' . 
ZZ' - I . 
DD' = 1 . 



By the same reasoning the matrix u can be shown to be orthogonal. With 
real parameters in X, F, and Z, it is clear that several successive rotations 
must give rotation as a product. 

The row-by-column multiplication of the matrices XYZ~u, in that 
order, gives the transformation 



(20) 



* H. W. Turnbiill aad A. C. Aitken, J.n Introduction to the Theory of Canonical Matrices 
(London and Glasgow: Blackie & Son, 1932), p. 33. 
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This orthogonal transformation satisfies the requirements in that it can 
readily be generalized to any number of dimensions. Its parameters are all 
fractional, since they represent sines and cosines of the successive angles 
of rotation. For infinitesimal rotations the sines become infinitesimal, so 
that second powers in the ^-parameters can be neglected. If we suppress 
the terms of second degree in the x's, we have 



y = 1/1 - z 2 = 1 , 
so that the transformation takes the form 

1 -a* -^ 
(21) 



for infinitesimal rotations. This is a skew-symmetric matrix, and it is of 
some interest to note that an infinitesimal orthogonal transformation seems 
always to take this form quite irrespective of the many alternative ways in 
which the finite rotation may be described. This generalization concerning 
infinitesimal orthogonal transformations seems also to hold for higher di- 
mensions. 

Finally, when the successive angles of rotation vanish, the respective 
sines vanish, the x-parameters vanish, and the transformation (20) reduces 
to the identity matrix. This is, of course, what one should expect. 

For some purposes the skew-symmetric form (21) may be useful with a 
rotational criterion. When the x-parameters of (21) have been determined, 
they maybe substituted in the orthogonal transformation (20) with assur- 
ance that the trait configuration will not be disturbed. The resulting fac- 
torial matrix can be subjected again to a rotation by the same criterion, esti- 
mating the parameters by (21) and rotating by (20). For some problems it 
may be best to retain all terms of second degree in the x-parameters of (20). 
In this case the third and higher powers may be ignored. 

In the transformation (20) the ^-parameters may be expressed in terms 
of the x-parameters. We have then 



y = - 
Expanding and ignoring terms of third and higher degree, 
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2/22/3 ~ - 



2 ' 



Proceeding in the same manner for the other cells of (20), it takes the form 



1 ^1 

1 2 2 



(22) 



This transformation is obtained by ignoring terms of third and higher de- 
gree, while (21) is obtained by ignoring terms of second and higher degree. 
Forms like (21) and (22) may be used to estimate the numerical values of 
the parameters. The actual rotation can be effected by an orthogonal trans- 
formation (20) with the parameters so determined. 

Alternative transformations 

The orthogonal transformation (20) is not unique. Other orthogonal 
transformations may be used, but the one that has been described may 
satisfy best the requirements that seem to be indicated for the factor prob- 
lem. Among the various possible orthogonal transformations that have been 
investigated there may be mentioned the following, namely, 



(23) 



(1+af)* 

1 

-an 







which can. be generalized to hyperspace. The parameters are not necessarily 
fractional for finite rotations, and this would constitute a handicap in some 
forms of manipulation. 
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One type of orthogonal transformation of special interest lias been men- 
tioned by Professor E. B. Wilson.* An interesting characteristic of this 
transformation is that the parameters are all rational and independent. It 
can be generalized to hyperspace. For a rotation in three dimensions it 
takes the following form: 



(24) 



1 - 


f p 2 





f 


r 2 




2pq 


c 


2r 




2pr 


c 


2<? 




c 





2pq - 2r 
c 

- p 2 + <? - 
c 

2qr + 2p 
c 



2pr + 2q 
c 

2qr 2p 
c 

- p 2 - <f + r 2 
c 



where c = l+p 2 +# 2 +r 2 , and tan 2 0/2=jP+g?+i*, while is the angle of 
rotation about an axis I. The direction cosines of I are proportional to 
p, q, r. This would probably be the best form of orthogonal transformation 
for the factor problem except for the fact that the parameters are not nec- 
essarily fractional for finite rotations. (Consider for example 6 =TT in (24).) 
Fractional parameters are convenient for some computing purposes in which 
second and higher powers of the parameters are to be ignored. Again, it may 
be desirable in some computations to start with trial values of the param- 
eters and to solve for a small correction for each parameter. In order to be 
able to work with linear normal equations it is necessary to be able to ig- 
nore second and higher powers of the corrections. These considerations 
would lead one to prefer a transformation in which the parameters are 
fractional by definition. However, transformation* (24) may be used with 
a suitable multiplier. This device could also be used on transformation 
(23), but such possibilities have not yet been investigated. 

Ignoring the terms of second degree in (24), the transformation reduces 
to the form 



(25) 



1 ~2r +2q 
+2r 1 -2p 
-2q +2p 1 



which is again a skew-symmetric matrix. 



* "On the Invariance of General Intelligence/' Proc. Nat. Acad. Sci., XIX, August, 
1933, p. 771, n. 5. 
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Rotation in four dimensions 

The procedure of writing any orthogonal transformation which has been 
described can be generalized to any number of dimensions. It will be ex- 
tended here to four dimensions. Each independent rotation may be regarded 
as a disturbance of a pair of columns in the factorial matrix. Each of these 
independent rotations is determined by one of the independent x-parame- 
ters and its dependent ^-parameter. The number of independent ^-param- 
eters required to determine a rotation in space of r dimensions is equal to 
the number of possible pairs of columns that may be taken in the nXr ma- 
trix F. This is |r(r 1), and consequently we should expect to have six in- 
dependent parameters for a rotation in four dimensions. 

If the four columns of F are numbered, then the six parameters may be 
associated with pairs of columns in F. These may be taken in the following 
arbitrary order: 1-2, 1-3, 1-4, 2-3, 2-4, 3-4. Let the corresponding inde- 
pendent parameters be xi, #2, #3, #4, #5, #e, and the corresponding dependent 
parameters yi, y Z} 2/3, y*, 2/s, 2/e. These are related to the independent param- 
eters as follows: 

Xi = sin ai , 



2/i = cos 



= 1/1 x\ , 



with analogous interpretation for each of the other five subscripts. Each 
pair of columns in F is represented by an orthogonal transformation. The 
matrix product of these six transformations is the matrix of the orthogonal 
transformation in four dimensions. The six independent rotations are as 
follows: 



yi -Xi 





xx ft 








1 





1 





1 






-2 


2/2 









1 
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B 3 = 



2/3 








-x* 





1 














1 





x s 








2/3 


1 














2/4 


-*4 








Z4 


2/4 














1 


1 














2/5 





5 








1 








x*. 





2/5 


1 














1 














2/6 


-x 6 








x, 


2/6 



B & = 



Let the given co-ordinates be a, and let the final transformed co-ordinates 
be A The points a are to be subjected to six independent and successive 
rotations which bring them to the co-ordinates A. Let the six independent 
rotations be represented as follows: 

d = cB% , 

e = dBi , 
(26) f 
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Combining these six independent rotations, we get the single rotation 

A = 



Let 

V s= BiB% . . . BQ , 

Then 

(28) A = av , 

in which v is an orthogonal matrix of order 4 with six independent param- 
eters and six dependent parameters. 

After performing the matrix multiplication of (28) we have the following 
expressions for the cell entries of v: 

vv. = 2/12/22/3 * 
t>2i = 2/22/3^1 > 

t>31 2/3X2 , 
t>41 = ^3 , 



+2/12/42/5 
+2/22/5^4 



= 2/6^1^4 - 2/12/42/6^2 + 2/4^1^5^6 + 2/1^2^4X 5 X 6 - 2/12/22/5^6 



t>33 = 2/22/42/6 
^43 = 2/32/5^6 , 

Vu = -xix&t + yiy&&s + y$&&s + 2/12/^2^4^5 - 2/12/22/52/6^3 

^24 = +2/1^4^6 + 2/4X1X2X6 ~ 2/12/4^6X5 + 2/6X1X2X4X5 2/22/52/6^1^3 
V34 = 2/22/4X6 2/22/6^4X5 2/5^6X2X3 , 
#44 = 2/32/52/6 
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If the terms in second and higher degree in the independent z-parameters 
are ignored, the matrix v reduces to 



(29) 



Xi I 3 4 5 

2 #4 1 XQ 

3 5 XQ 1 



It will be seen that (29) is again a skew-symmetric matrix. It reduces to 
the identity matrix when the six rotational angles vanish. 

The method which has just been described can be generalized to hyper- 
space. A rotation in five dimensions requires ten independent parameters. 
Six dimensions require fifteen independent parameters. If the number of 
primary factors is fairly large, it seems evident that the direct application 
of an orthogonal transformation to the factorial matrix F in the search for 
the primary factors is prohibitive in computational labor. The use of an 
orthogonal transformation on F presupposes the serious restriction that the 
primary factors are statistically independent in the experimental popula- 
tion. Since this is a condition that cannot be assumed in most factor prob- 
lems, the rotational transformations must be subject to the same limitation. 



CHAPTER X 
THE APPRAISAL OF ABILITIES 

The regression x on 5 

The principal problem to which the previous chapters have been directed 
is that of isolating and identifying primary factors in a battery of traits. 
The psychological application of factor theory which is of most general cur- 
rent interest is the isolation of primary abilities. The present chapter is 
directed to the problem of appraising the several primary traits in each in- 
dividual. The methods to be described are applicable not only to the psy- 
chological problem of describing the mental and physical traits of individu- 
als, including native as well as acquired traits, but also to any situation in 
which it is desired to describe the individual members of a statistical group 
as regards the traits that may have been found to be primary. 

Each individual member of the statistical population is described in terms 
of r abilities. Let the standard score of individual i in the primary ability 
p be denoted x pi . It is desired to estimate x pi in terms of the n tests which 
individual i has taken. The standard score of individual i on a test j has 
been denoted $/,-. 

The regression x pi on s/ is as follows : 

(1) 

where the subscript p refers to primary abilities, j refers to the tests, i re- 
fers to the individuals, W P3 - is the weight of the score s/$ in test j in the ap- 
praisal of the primary ability p, p p t is the residual or discrepancy between 
the true value of x p i and the best value which can be obtained as a linear 
function of the test scores s,-. Expressing this equation explicitly for the 
residual, 

(2) 

It is desired to determine the values of w p} - which will minimize p P i. Squar- 
ing (2), 

(3) a| 

/-i 
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Summing for the population, and dividing by N, 

N N n N n n 



^^ pi Up , 

where w p is the quantity to be minimized. 
But 

w 

TO y 

since o^ is a standard score. Substituting (5) in (4) and rearranging, 

n N n n N 

_ A I V < V -k ^E * I 

(6) 1- 

The summation 

(7) ]y^ f ]^"^l> '1PJ 

where r ip is the correlation between the test j and the primary ability p. It 
is the scalar product of the test vector j and the primary vector T p . It is 
here assumed that the primary abilities may be correlated in the experi- 
mental population N. 
The summation 

(8) 

where R& is the correlation between tests j and k. The correlation JK# in 

(8) is equal to r# when j^ k, but it is unity when j=k. 

Substituting (7) and (8) in (6), 



(9) 1 - ZWpfto + 

j=l 

The normal equations for determining w^ or w?^, are in the form 
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Taking partial derivatives in (9), 

(ID I?" - - *** + 



Setting the partial derivatives equal to zero, dividing the equation by 2, 
and transposing, 



(12) w pk Ru = r ip = r pf 



Equation (12) represents a matrix multiplication which may be written 
in matrix notation, 

(13) w ph R k ,' fi p y , 

where Wpfc is a matrix of order rX^ and Rkj is a matrix of order nXn. The 
latter is of rank n because the diagonal elements are unity, and hence spe- 
cific factors and error factors are involved. The matrix R PJ - is of order rXn. 
Since R kJ - is non-singular, the equation (13) may be written explicitly for w P k. 
Then 

(14) w pk = Brffig 1 = BrfBjj 1 . 
Writing (14) in transposed form, and using w p ie=w P j, 

(15) w fp RtfRi, , 

by which the numerical values of WJ P W P J in (1) can be determined. 

The regression $ on x 

This regression implies that the primary abilities of an individual are 
known and that it is desired to estimate what his performance will be on a 
test with known factorial weightings. This is the reverse of the previous re- 
gression x on 5 in which it is assumed that an individual's scores are known 
and that his primary abilities are to be appraised. 

The case in which % is to be estimated by x P i can be written in the form 



p=l 



where WJ P is the weight of the score x pi in the estimate of the score s/ t -, and 
Pfi is the discrepancy between the actual score s# and the estimated score 
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in test j. It should be noted that w/ p in (16) is not the transpose of w p j in (1), 
since these are coefficients in two different regressions. Writing (16) explicitly 
for PJ, 



(17) s 3 <i 

P =i 

Squaring (17), 

r 

(18) Sft - 2s,'i^T WfrSpi + 



r r 



Summing for the population and dividing by N, 









where u$ is the quantity to be minimized. 
But, by definition, 

(20) 



Substituting (20) in (19), and rearranging, 

r N 



- 2 r s^i + 



1 ' * u * 



The summation 
(22) 

and the summation 
(23) 



where B P2 is the correlation between the primary abilities p and q. It can 
also be regarded as the cosine of the angular separation between the two 
primary unit vectors T P and T g . 
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Substituting (22) and (23) in (21), 



(24) 1 - 2Wj p Tj p + WjpWjeRn = U,- . 

P=I 3=1 p=i 

The normal equations are in the form 

(25) J^- ~ . 

dws p 

Taking partial derivatives in (24), 

(26) = 



Setting the partial derivatives equal to zero, dividing the equation by 2, 
and transposing, 



(27) 

<z=i 

Writing equation (27) in matrix notation, 

(28) w jq R gp = R fp . 

Since the primary abilities are linearly independent, it follows that the 
rank of R qp is r. Hence R qP is non-singular. Equation (28) may therefore 
be written explicitly for w^, 

(29) UK - R i9 R , 

by which the weights W^WJ P ia the regression equation (16) may be com- 
puted. 

It is of interest to note the form which equation (29) takes in the special 
case where the primary abilities are orthogonal. Then 

(30) R~ p l I , the identity matrix, 
and 



where FJ P is a factorial matrix with orthogonal primary reference vectors, 
so that 

(31) w fp = F i9 . 
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Substituting (31) in the regression equation (16), we have 

(32) */t = FjpXpi + Pn 

In this equation p,- t - is that part of the score /* which is not produced by the 
primary common factors. Hence pa is produced by specific and error fac- 
tors. In the simplest case where all of the contributing factors are common, 
we have 



(33) y< 

which is the first equation of chapter i, as was to be expected* 
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OUTLINE OF CALCULATIONS FOR THE CENTROID 
METHOD WITH UNKNOWN DIAGONALS 

Ann Xr matrix .F may be obtained from a given correlational matrix R 
with unknown diagonals by the following calculations. This is the method 
described in Example 6, chapter iiij except that the reflection of traits will 
not always be carried to the point where all column sums are positive when 
the diagonals are ignored. Only those traits will be reflected which minimize 
the number of negative signs in each column of R, as described in Example 
5, chapter Hi. 

The method will be described in relation to the computations on a 9 X 9 
table of experimental correlations given by Professor Carl C. Brigham in 
his 1928 annual report to the College Entrance Examination Board. The 
data represent nine intelligence tests used by the College Entrance Board. 
The correlations are based on the records of 4,175 boys. 

The calculations are recorded on data sheets* devised for twenty varia- 
bles or less, In working with more than twenty variables, the correlation 
table may be divided into 20 X 20 sections with a data sheet for each section. 
The notation S , B, D, E, and K, on the data sheet is the same as that used 
in the tables of chapter Hi* 

Steps in calculation 

1. Record the table of intercorrelations as shown in Table 1. This may 
be any nXn correlational matrix R with elements r^ which satisfies the 
inequality (5 ii). In this example RQ is given, n = 9, and the inequality is 
satisfied if the number of factors turns out to be 5 or less. 

2. Record the signs of these correlations as indicated in the upper part 
of the narrow cells provided for the signs. This corresponds to the "first 
position" of the signs described in example 5 of chapter Hi. 

3. The diagonal cells of this table are blank, since the communalities are 
unknown. The cells for the entries ru, r 22 , . . . , r gg are the diagonal cells. 

4. Since all of the entries of this table are positive, it is not necessary to 
reflect any of the tests. When any column of R has a majority of negative 
coefficients, traits are reflected at this stage of the procedure by the method 
described for Table 2. 

* These data sheets are available at the University of Chicago Bookstore, Chicago, 
Illinois. 
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5. Decide upon the estimate of the communalities to be used. A small 
number of variables demands a more accurate estimation of the commu- 
nalities. When n is large, Method 4 of chapter ii is recommended. This 
method will be used here. 

6. Pick the highest coefficient in each column, disregarding sign, and re- 
cord it in the diagonal with positive sign placed in the upper half of the nar- 
row sign cell. 

EXAMPLE: The highest correlation in column 1 is .625. It is recorded in the 
diagonal cell of that column as +.625. If the highest coefficient in this 
column had been .625, it would still have been recorded in the diago- 
nal as +.625. 

7. Add the entries in each column and record the sums in row D at the 



bottom of the data sheet. These are the sums >r^=r fc for each column k 

j-l 

of equation (12-iii). 

EXAMPLES : The sum of the nine entries in column 1 is 5.022. This is record- 

9 

ed in row D, column 1. It is the sum 



The sum of the nine entries in column 2 is 4.213. This is recorded in 

9 

row D, column 2. It is the sum r/ 2 = r 2 . 



8. Add the entries in each row of Table 1 and record the sums in column 

n 

D at the extreme right of the data sheet. These are the sums r#=r,- for 



each row.;. These sums should agree with their corresponding column sums 
recorded in row D at the bottom of the data sheet. 

9. Add all the column sums in row D. Eecord this value, 42.072, in 

n n n 

row D, column D. This is the sum ^^^^^ ^ ec l ua ' tio11 



(8-iii) . 

10. Add all the row sums of column D. This gives 

*^I 

rf = 42.072. Check to see that this sum agrees with the sum obtained in 
step 9. 

11. Determine t/rv In this example, Vn = 1/42.072= 6.486293. Record 
this value in the space below r t . 
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12. Compute the reciprocal, -7= . For these data, the value is 1/6.486293 

v ?t __ 

= .154171. Record .154171 in the space below Vr t . 

13, Multiply each sum in row D by the value 7= = .154171 obtained in 

V r t 

step 12 and record the results in row E at the bottom of the data sheet. 
Each value in row E takes the sign of its corresponding sum in row >. 
These are the first-factor loadings, a&i, with the signs of the variables as 
used to obtain the sums in Z>. 

EXAMPLES: Test 1: a^ - .154171(+5.022) = +.774 . 
Test 2: <& .154171(+4.213) = +.650 . 



Test 9: <4 = .154171(+5.009) = +.772 . 

14. The product r t ( -7= ) should give Vr t recorded on the data sheet if 

\VT t / 

the arithmetical work in determining the multiplier has been correct. In 
this example, 42.072 (.154171) = 6.486282, which checks with the recorded 
value of l/V* = 6.486293 to the fourth decimal place. 

15. If the loadings in row E represent a centrpid system, then 2)2?, the 
sum of all the entries in row E, should equal 1/n. Record SJ in the space 
2E in the lower right corner of the data sheet. 

EXAMPLE: IE = 6.486 . 
1/ri- 6.486 . 

16. Copy the values of row E in row K with the sign reversed for each 
test which has been reflected an odd number of times. Any test reflected 
an odd number of times will have the last recorded sign negative before its 
variable number. Since no tests were reflected in this table, the values in 
rows E and K are the same, all of the first-factor loadings are positive, and 
2K= S#= 6.486. 

17. Record the values of row K in the first column of Table 7. Table 7 
will be the nXr matrix F when r factors have been extracted. 

18. Take a new data sheet and label it "First-Factor Residual Coeffi- 
cients: r 2 *jfc" as shown in Table %. 

19. Insert the variable numbers with signs as given in Table 1, in the 
second row and second column provided for them in Table 2. Place the 
signs in the upper half of the narrow sign column. In this example, all of 
these signs are positive. 

In data where traits are reflected in the first table, the signs transferred 
to Table 2 are those of the traits on Table 1 after reflection. 
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20. Copy the first-factor loadings from row E of Table 1 in the first row 
and in the first column labeled E r in Table 2. This arrangement facilitates 
residual computations on a calculating machine. 

EXAMPLES: The first-factor loading in test 1 from row E of Table 1 is 
+.774. It is recorded in Table 2 in the space in front of variable 1 in col- 
umn E f and in the space above variable 1 in row E f . 

Similarly, the first-factor loading in test 2 from row E of Table 1 is 
+.650. It is recorded in Table 2 in the space in front of variable 2 in 
column E f and in the space above variable 2 in row E r . 

21. Check this transfer by adding the loadings for the nine rows of col- 

9 

umn E'. This gives SJ0' = 0^ = 6.486, which is the value of S#on Table 



= 

1. Record this sum in the space marked *%E' at the left of the data sheet. 
Add the loadings for the nine columns of row W. This gives 2E' = 

9 

^ a ' kl = 6.486, which is the value of 'ZE on Table 1 . Record this value of 
=i 
E f in the space provided in the upper right corner of the data sheet. 

22. Compute the first-factor residuals by formulae of the type (14-iii), 



and record in the jih row and fcth column of Table 2. 
23. For column 1 of Table 2, these residuals are 



7*2. jl = Tji 

where fc = 1 and./ takes values from 1 to 9. The value r n is the entry in the 
jth row and first column of Table 1; a^ is the first-factor loading for test j 
recorded in row j of column W in Table 2 and oi. is the first-factor loading 
in test 1 recorded at the top of column 1 ha row E f of Table 2. 

EXAMPLES: 

r 2 . u = +.625 .774(.774) = (+.026) . Record above double line in 

column 1. 

72*1 = +.482 ,650(.774) = .021 . Record in row 2, column 1. 
r a .8i = +.617 - .731 (.774) = +.051 . Record in row 3, column 1 . 
r 2 .4i = +.518 - .665(.774) = +.OOS. Record in row 4, column 1 . 
ra.5i = +.625 .804(.774) = +.003. Record in row 5, column 1 . 
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r 2 . 6 i = + .422 - .593 (.774) = .037 . Record in row 6, column 1 . 
r2 . n = +.584 - .738(.774) = +.013. Record in row 7, column 1 . 
r 2 .si = + .563 .759 (.774) = .024 . Record in row 8, column 1 . 
r 2 . 9 i = +.586 .772(.774) = .012. Record in row 9, column 1 . 

The sign of each residual is recorded in the upper half of its narrow sign cell. 
The diagonal for this column and for all succeeding columns is recorded in 
the space just above the double line on the data sheet. This leaves the diag- 
onal cell vacant in each column. 

24. Add the entries in column 1, including the diagonal, and record in 
column 1 for the row marked "Actual Z " at the bottom of the data sheet. 
This sum should be zero or nearly zero. It is +.002. 

25. The expected value of this sum, designated "Check S ," on the data 
sheet may be calculated for each column k by the formula, 

r k - a&E' , 

where rk is the sum in row D, column k of Table 1, ah is the first-factor load- 
ing at the top of column k in Table 2, and $' has the value already recorded 
on Table 2. 

EXAMPLE: For column 1, this check is 

r = 5.022 - .774(6.486) - + .002 . 



This agrees with the "Actual S " value for column 1 indicated in step 24. 

The "Actual 2) " and "Check So" values are not always exactly the same 
as in this case, but their difference very seldom exceeds .003 when three 
decimals are used in the calculations. 

26, Since the residual tables are all symmetric about the diagonal, the 
calculated entries in column 1 may be copied in their corresponding cells 
in row 1, i.e., 

7*2.21 = 72-12 . 021 . Record in row 1, column 2 . 
r 2 .si = ?Vi3 = +.051 . Record in row 1, column 3 . 
7*2.41 = 72-14 = + ^003 . Record in row 1, column 4 . 
^2.51 = r 2 .i5 = +.003 . Record in row 1, column 5 . 
7*2.61 = r 2 .i6 = . 037 . Record in row 1, column 6 . 
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7*2-71 J"2.i7 = +.013 . Record in row 1, column 7 . 
7*2-81 7*2.18 = .024 . Record in row 1, column 8 . 
7*2-91 = 7*2.19 = .012 . Record in row 1, column 9 . 

When this step is completed, all cells in row 1 and column 1 of Table 2 are 
filled except the diagonal. 

27. Add all the entries in row 1, including the diagonal, and record the 
sum in the column labeled "Actual S " at the right of the data sheet. This 
sum should agree with the sum in step 24. It is +.002. This check is valu- 
able in working with a large number of variables; it is not necessary when 
n is twenty or less. 

28. Calculate the residuals in the diagonal and below it for each column k 
of Table 2 in the manner described in steps 22 and 23. 

EXAMPLES: 

Column 2: 7*2.3-2 = r& oji^i - 

r 2 . 22 = +.592 - .650(.650) = (+.170) . 
7*2.32 = +-397 - .731(.650) = -.078 . 
r 2 -42 = +.397 - .665(.650) = -.035. 



Column 3: r 2 . 3 -3 = TVS 

r 2 . 33 = +.626 - .731(.731) = (+.092) . 
r2 . 43 = +.472 - .665(.731) - -.014. 
r 2 . 53 = +-626 - .8040731) = +.038. 

29. As soon as the residuals for column k are computed below the diag- 
onal, fill in the entries in the row for that test above the diagonal by sym- 
metry, as described in step 26. 

EXAMPLES: 

Row 2: 7*2.32 = r 2 .23 = - . 078 . Record in row 2, column 3 . 
7-2.42 = 7-2.24 = . 035 . Record in row 2, column 4 . 



7*2.92 = 7* 2 29 = . 062 . Record in row 2, column 9 . 
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Row 3: r 2 .43 = r 2 . 34 = .014 . Record in row 3, column 4 . 
r 2 . 53 = r 2 .35 = +.038 . Record in row 3, column 5 . 

7*2.93 = r 2 .39 = -007 . Record in row 3, column 9 . 

30. Check the accuracy of the residual calculations for each column k by 
the methods described in steps 24 and 25; make this check for each row j by 
the method of step 27. 

EXAMPLES : 

Test 2: r 2 - c&2E' = 4.213 - .650(6.486) = - .003 . 
Actual So = - .002 . 



Test3: r s - a f z {LE f = 4.744 - .731(6.486) = + .003 . 
Actual S = +.004 . 



n tests: r t -^i 1 = 42 ' 072 ~ 6.486(6.486) = + .004 . 

y=l &=i 

Actual So = +.006. 

31. Table 2 should now have every entry filled except the diagonals, and 
all of the sums, S , should be recorded, 

32. Pick the highest coefficient in each column, disregarding sign, and 
record it in the diagonal with positive sign. This sign should be in the up- 
per half of the narrow sign cell. 

33. Prepare a table similar to Table 3 with variable numbers 1 to n at the 
top of the columns. Add a check column and one labeled "&." This table 
will be used to minimize the number of negative signs in each column of 
Table 2 in order to determine the tests to be reflected. 

34. Count the number of negative signs in each column of Table 2 and 
record in row 1 and in the proper column of Table 3. These are the values, 
NJ, described in. Example 5, chapter in. 

EXAMPLES: The number of negative signs in column 1 of Table 2 is four; 
hence 4 is the entry in the first row and first column of Table 3. 

Similarly, there are six negative signs in column 2 of Table 2; conse- 
quently the entry in column 2 of the first row of Table 8 is 6. 

35. Check these values by counting the number of positive signs in each 
column excluding the diagonal The sum of the positive and negative signs 
in each column must be (n 1). 

36. Add all the entries in the first row of Table 3 and record in the check 
column. This sum is 46. 
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37. Pick the test having the highest number of negative signs to be re- 
flected first. Tests 2, 6, and 9 have a maximum of six negative signs. The 
choice of one of these is arbitrary; test 2 is chosen here. 

38. Record the variable number 2 in the column headed fc< and put an 
"X" above column 2 in Table 3 to indicate that this test is to be reflected. 
An adjustment in the number of negative signs for each column will be made 
as if test 2 were reflected in Table 2; these results will be recorded in row 2 
of Table 3. 

39. For the trait being reflected, i.e., test 2 in this case, the entry in its 
column of row 2, Table S t will be (n 1) minus the number of negative signs 
for its column in row 1 of Table 3. The value of O 1) is the total number 
of entries in each column of Table 2, ignoring the diagonals. 

In this example, (n 1) = 8, and the entry in column 2, row 2, of Table 3 
becomes 86 = 2. 

40. Proceed to that row of Table 2 for the test being reflected, i.e., row 2, 
and consider the sign of each entry there except the diagonal. 

a) If that entry for a given trait not previously reflected is positive, increase 
by one the number of negative signs for that column recorded in row 1 of 
Table 3, and record the new value in its proper column of row 2, Table 3. 

EXAMPLES: The entry for test 6 in row 2 of Table 2 is positive. Test 6 has 
not been previously reflected. The number of negative signs for test 6 re- 
corded in row 1 of Table 3 is six. Consequently, 6 is increased one, giving 
7 as the entry in row 2, column 6, of Table 3. 

The entry for test 8 is also positive in row 2 of Table 2. Test 8 has not 
been previously reflected. In the same manner, its number of negative 
signs is increased one, giving 6 as the new value in row 2, column 8, of 
Table 3. 
fe) If the entry for a given test not previously reflected is negative, decrease 

the number of negative signs for that column by one and record the new 

value in its proper column of row 2, Table 3. 

EXAMPLES: The entry for test 1 in row 2 of Table 2 is negative. Test 1 has 
not been previously reflected. The number of negative signs for test 1 
recorded in row 1 of Table 3 is four. Hence, 4 is decreased one, giving 3 
as the entry for test 1 in row 2 of Table 3. 

The entries for tests 3, 4, 5, 7, and 9 are all negative in row 2 of Table 2. 
Since none of these tests has been previously reflected, the number of 
negative signs for each of them will be reduced one. This gives the entries 
4, 4, 4, 3, and 5 for these respective tests in row 2 of Table 3. 

All of the cells in row 2 of Table 3 should now be filled. 

41. Add all of the entries in row 2 of Table 3. This sum is 38, 

42. If the sum of all the entries in row 1 of TaUe 3 (from step 36) minus 
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the sum of all the entries in row 2 of Table 3 (from step 41) is twice the dif- 
ference between the number of negative signs for the reflected trait 2 in 
these two rows of Table 3, the arithmetical work in deriving row 2 of Table 3 
is checked. 

EXAMPLE: 46 - 38 = 2(6-2) . 
8 = 8. 

43. Pick the test having the highest number of negative signs in row 2 
of Table 3 as the next test to be reflected. This is test 6 with a maximum of 
seven negative signs. 

44. Record the variable number 6 in column fc$ of Table 3 and put an 
"X" above column 6 to indicate that this test is to be reflected. 

45. The entry in column 6, row 3, of Table 3 will be (n Y) minus the 
entry for test 6 in row 2 of Table 3, i.e., 8-7 = 1. 

46. Proceed to row 6 of Table 2 and consider the sign of each entry there 
except the diagonal in order to adjust the number of negative signs of row 2, 
Table 3, as if test 6 were reflected. These new values will become row 3 of 
Table 3. 

a) For the tests which have not been previously reflected the same rules of 
adjustment of number of negative signs apply as in steps 40a and 406, 
except that the adjustment is made with reference to row 2 instead of row 1 
of Table 3. 

EXAMPLES: The entry for trait 4 is positive in row 6 of Table 8. Trait 4 has 
not been previously reflected. Trait 4 has four negative signs in row 2 of 
Table 3. Hence, it will have 4+1 = 5 negative signs recorded in row 3 of 
Table 3. 

The entries for tests 1, 3, 5, 7, 8, and 9 are negative in row 6 of Table 2. 
None of these tests has been previously reflected. Hence, the entries for 
these respective columns in row 2 of Table 3 are each reduced one. This 
gives 2, 3, 3, 2, 5, and 4 as the entries for columns 1, 3, 5, 7, 8, and 9, 
respectively, in row 3 of Table 3. 

6) When an entry in row 6, Table 2, is positive and the test has been pre- 
viously reflected in this table, decrease by one the number of negative signs 
for that trait as recorded in row 2 of Table 3, and record the new value in its 
column of row 3, Table 3. 

EXAMPLE: The entry for test 2 is positive in row 6, Table 2. Test 2 was pre- 
viously reflected. The number of negative signs for test 2 in row 2 of 
Table 3 is two. Hence, the entry for test 2 in row 3 of Table 3 becomes 
2-1 = 1. 

This gives the nine entries in row 3 of Table 3. 
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47. Add the entries in row 3 of Table 3. This sum is 26. 

48. If the sum of the entries in row 2 of Table 3 (in step 41) minus the 
sum of the entries in row 3 of Table 3 (in step 47) is twice the difference be- 
tween the entries in these rows for the test being reflected, i.e., test 6 in this 
case, the arithmetical work in deriving row 3 of Table 3 is checked. 

EXAMPLE: 38 - 26 = 2(7-1) . 
12 = 12 . 

49. Pick the trait having the highest number of negative signs in row 3 
of Table 3 as the next one to be reflected. 

The maximum number of negative signs in row 3 of Table 3 is five for 
tests 4 and 8. Either one may be reflected; test 4 is arbitrarily chosen here. 

50. Write the variable number 4 in column ki of row 3, Table 3, and put 
an "X" above column 4 to indicate that test 4 is to be reflected. 

51. The entry for trait 4 in row 4 of Table 3 will be (n 1) minus the num- 
ber of negative signs for trait 4 in row 3 of Table 3. 

EXAMPLE: 8 5 = 3. 

52. Proceed to row 4 of Table 2 and consider the sign of each entry there 
except the diagonal, in order to adjust the number of negative signs of row 3, 
Table 3, as if test 4 were reflected. These new values will become row 4 of 
Table 3. 

a) For the tests which have not been previously reflected, the same rules 
of adjustment of number of negative signs apply as in steps 40a and 406, 
except that the adjustment is made with reference to row 3 instead of row 1 
of Table 3. 

EXAMPLES : The entries for the unreflected tests 1 and 7 in row 4 of Table & 
are positive. Hence, their entries of row 3, Table 8, are each increased one 
and recorded in row 4 of Table 3. This gives the entry 3 for each of the 
columns 1 and 7 in row 4 of Table 3. 

The entries for the unreflected tests 3, 5, 8, and 9 are negative in row 4 of 
Table 2. Hence, their entries in row 3 of Table 3 are each reduced one, 
giving the entries 2, 2, 4, and 3, respectively, for these variables in row 4 
of Table 3. 

b) The entry for the previously reflected test 6 is positive in row 4 of 
Table 2. By the method of step 465, its value in row 4 of Table 3 becomes one 
less than its value in row 3 of Table 3. 

EXAMPLE : 1 1 = = entry for test 6 in row 4 of Table 3. 

c) When the entry for a previously reflected test is negative in row 4 of 
Table 2, increase by one the number of negative signs for that test as recorded 
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in row 3 of Table 3 and record the new value in its column of row 4 
Table 3. 

EXAMPLE: The entry for the previously reflected test 2 is negative in row 4 of 
Table 2. The number of negative signs for test 2 in row 3 of Table 3 is one. 
This value is increased one to give 2 as the entry for test 2 in row 4 of 
Table 3. 

This gives the nine entries in row 4 of Table 3. 

> 53. Add all the entries in row 4 of Table 3 and record in the check col- 
umn. This sum is 22. 

54. If the sum of all the entries of row 3, Table 3 (from step 47), minus 
the sum of all the entries for row 4, Table 3 (from step 53), is twice the dif- 
ference between the entries in these rows for the test being reflected, i.e., 
test 4, the arithmetical work in deriving row 4 is checked. 

EXAMPLE: 26 - 22 = 2(5-3) . 
4 = 4. 

55. Zero entries sometimes appear in residual tables, such as Table 2, 
They are treated as of positive sign in making sign adjustments for the reflec- 
tion of tests. 

56. It sometimes happens that a test already reflected in a table may 
appear a second time (or any even number of times) as the test having the 
maximum number of negative signs. In this case, it is reflected back to its 
original position in the configuration by reversing each of the rules enu- 
merated in steps 40a, 406, 466, and 52c. 

EXAMPLE: For purposes of illustration only, reflect test 2 a second time 
as if it had had a maximum number of negative signs in row 4 of Table 3. 
The number of negative signs for each test would then be those of Row 5 
in the following table: 
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Again, for purposes of illustration, reflect test 6 a second time as if it had 
had a maximum number of negative signs in row 5 above. The sign adjust- 
ment recorded in row 6 is obtained in the same manner as for test 2 recorded 
in row 5, except to note that for the entry in row 6, column 2, of Table 2, the 
rule must be adjusted to take account of the fact that test 2 is now in its 
unreflected form, since it has been previously reflected twice. The sign of the 
entry in row 6, column 2, of Table 2 is positive; hence rule 40a is the one to 
be reversed. This gives the entry 5 in row 6, column 2, of the foregoing 
table. 

In case a test is reflected a third (or any odd number of times), the rules, 
40a, 406, 466, and 52c apply directly by considering each test previously re- 
flected an even number of times as an unreflected test, and each test previously 
reflected an odd number of times as a test previously reflected once. 

EXAMPLE: Row 7 of the foregoing table gives the adjustment in number of 
negative signs of each column as though test 2 were reflected a third time. 
The rules apply directly except for the entry in column 6; test 6 has been 
previously reflected twice, so that it is considered as an unreflected test hav- 
ing a positive sign in row 2, Table 2; rule 40a then applies directly, and 
the entry for test 6 in row 7 becomes 8. 

When test 6 is then reflected a third time, the rules apply directly, 
except for the entry for test 2, which has been previously reflected three 
times. Test 2 is then considered as a test previously reflected once, and 
rule 466 applies. The results are shown in row 8 of the foregoing sign 
table, 

The cases discussed in steps 55 and 56 do not occur in our present calcu- 
lations. 

57. All entries in row 4 of Table 8 are now equal to or are less than 

. Test 8 has four negative signs, which just balances the number of 



2 

positive signs, ignoring the diagonal. All the other tests have a majority of 
positive signs. The tests listed in column ki of Table 3 are now ready to be 
reflected in Table 2. 
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58. Reverse the signs of the tests indicated in column & of Ta&Ze 5 in 
their rows of Ta&fe 2. Record these signs in the lower half of the narrow 
sign cell for each entry involved. 

EXAMPLE: Reverse all the signs of entries in rows 2, 4, and 6 of Table 2, 
The entries in row 2 then have the signs +, , +, +, +, , +, , +, 
in the lower half of the narrow sign cell. This corresponds to the "second 
position" of the signs described in Example 5, chapter Hi. 

59. Indicate that the signs in rows 2, 4, and 6 have been changed by re- 
versing the signs before the variable numbers 2, 4, and 6 in the second col- 
umn of Table 2. Record these new signs in the lower half of the narrow 
sign cell. 

60. Reverse all of the signs in the columns for the tests being reflected 
and record these signs in the residual cells in front of each residual involved. 
Where two signs appear in the narrow sign cell for any entry, it is the sign 
in the "second position" that is reversed. 

EXAMPLE: Reverse the signs of all entries in columns 2, 4, and 6, The en- 
tries in column 2 then become +, +, +, , +, +, +, , +* 

61. Indicate that the signs in columns 2, 4, and 6 have been changed by 
reversing the signs before the variable numbers 2, 4, and 6 in the second row 
of Table 2. Record the new sign in the lower half of the narrow sign cell. 
These signs, after reflection for these variables, are the ones which will be 
transferred to the next residual table, i.e., Table 4. 

62. Copy the last recorded sign for each entry in the columns represent- 
ing the unreflected tests. Place these signs in the residual cells in front of 
each coefficient. 

EXAMPLE : These are columns 1, 3, 5, 7, 8, and 9 in these data. By this pro- 
cedure the signs of the residuals of column 1 are +,+,+,,+, +, +, 
, , as indicated in Table 2. 
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Test j 


I 


II 


III 


1 


.774 


.070 


.094 


2 


.650 


-.367 


-.218 


3 


.731 


.167 


.184 


4 


.665 


-.118 


.261 


5 


.804 


.306 


-.011 


6 


.593 


-.485 


-.071 


7 


.738 


.126 


.167 


8 


.759 


.039 


-.163 


9 


.772 


.291 


-.255 



63. Determine the sum of each column of Table 2, ignoring the diagonals, 
and record in row jB at the bottom of the data sheet. 

EXAMPLES: Test 1 - +.086 . 
Test 2 - +.516. 



Test 9 = +.430 . 

This step is useful only if it is desired to reflect tests until a maximum 
positive sum is secured. Steps 63 and 64 may be combined in cases where it 
is satisfactory to minimize the number of negative signs in each column of 
R without demanding a maximum positive sum. 

64. Add the diagonal value for each column to the sum for that column 
in row E. These are the sums +.137, +.723, . . . , +.572 in row D at the 
bottom of Table 2. 

65. Add all the entries in row D. This sum is 3.873. 

66. Add all of the entries in each row of Table 2, including the diagonal, 
and record in column D at the extreme right of the data sheet. The row 
and column sums D for the same test should agree. 

67. Add all the entries in column D. This sum is 3.873, which agrees 
with the sum in step 65. This is the value r'* of equation (19-iii). 

68 The multiplier --U i s obtained in the same manner as described in 



steps 11 and 12 for Table 1. Its value is .508132. 

69. Multiply each sum in row D by .508132 and record in row E. These 
are the second-factor loadings, a% of equation (19-iii). The signs of these 
loadings are those of the reflected variables. 



250 THE VECTORS OF MIND 

EXAMPLES: a( 2 " - .508132(+.137) = +.070. 
<4" = ,508132(+.723) = +.367. 

<4" = ,508132(+.572) = +.291 . 

70. Check: Add all the entries in row E and compare with l/r^. 
EXAMPLE: 2LE = 1.969 . 

l/r, = 1.968. 

71. Copy the values of row E in row K, with sign reversed for each test 
which has been reflected an odd number of times; i.e., for each test which 
has the last recorded sign negative before its variable number. 

EXAMPLE: Tests 2, 4, and 6 have the last recorded sign negative before 
their variable numbers, since each test has been reflected once. Conse- 
quently, their loadings in row K take signs opposite to those in row E. 

72. The sum of the loadings in row K should be approximately zero if a 
centroid system has been obtained. This sum for Table 2 is +.029. 

73. Copy the values of row K as the second column of Table 7. These are 
the second-factor loadings of the unreflected tests. 

74. Take a new data sheet and label it "Second-Factor Residual Coeffi- 
cients: r&jk" as shown in Table 4- 

75. Proceed, as in steps 18 through 73, to determine the second-factor 
residual coefficients, r 8 ./*, and the third-factor loadings. These calculations 
are shown in Table 4* 

Tests 4, 8, and 9 were reflected in Table 4* The sign table is shown in 
Table 5. 

76. The third-factor residual coefficients, r 4 .#, shown in Table 6, are suf- 
ficiently small to ignore. Consequently a fourth factor was not determined. 

77. Table 7 shows the projections of the nine tests of this example on the 
.three centroid axes obtained. This is the nXr matrix F of the fundamental 
factor theorem FF f ^R. 
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A METHOD OF FINDING THE HOOTS OF A POLYNOMIAL 
Consider a polynomial of the type (14-iv), 

i + C2/3*- 2 H ---- c r _i/3 + c r = , 



where r is a positive integer, co^O, and c , ci, . . . , c r are real coefficients. 
The r roots, /3, of this equation are desired. Determine the upper and lower 
limits* of the roots of this equation. Let a trial value of within these 
limits be ft'. If ft' is a root of the polynomial /(/3), then by the Remainder 
Theorem, t/(') 0. The numerical value of /(/3') may be determined on an 
electric calculating machine by computing f(ff)/(ft by the process of 
synthetic division. J Consider the sign of the numerical value of /(/?') and 
select a second trial root, designated ft", which will give /(") opposite in 
sign to that of f(ft'). When two such trial values of ft are found, there is at 
least one root** between them. Determine a third trial value, 0"', by linear 
interpolation between /(') and/(/3"). If the value jf(/3"') =0, then 0'" is one 
of the r roots of the polynomial. If /(]8"0 ^0, interpolate for successive trial 
values until that value of ft is found for which the remainder, /Q3), is zero 
to as many decimals as required. 

Repeat this process for each of the r roots of the polynomial by taking 
trial values in other regions between the upper and lower limits of the roots. 
In the method of principal axes of chapter iv, a very useful first approxima- 

n 

tion for each root, ft*, of the characteristic equation is 7^ aj m for each 



column m of F, when CD, Ci, . . . , c r are all positive. 

Tafrte 2 shows the application of the method of synthetic division hi cal- 
culating one of the roots of equation (27-iv), 

p + 6. 965369/3* + 10.8104940 2 + 5.407203/5 + .840052 = . 

The upper limit of the roots of this equation is any positive number; the 
lower limit is 7.965369. Since the equation for this' example Is from the 

* L. E. Bickson, First Course in the Theory of Equations (New York: John Wiley & 
Sons, 1922), pp. 21-23. 

t Ibid., p. 12. t Ibid., pp- 13-15. ** Ibid., p. 67. 
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method of principal axes, the first trial value /3' = 5.011317 = 

from Table (6-iv). Since /(/3') is negative, fi rf ~ 5.020000 was chosen arbi- 
trarily to secure a positive value of /OS"). Linear interpolation between 
/03') and/(j8") gave the third trial value, '"=-5.019710. Linear in- 
terpolation between /(0"0 and /(j8") gave the fourth trial value, 0"" = 
5.019712, for which the value of the polynomial is +.000018. Hence, one 
root of this equation is 5.019712. 

The values of /(/3) in each row of Table 2 were determined by the equa- 
tions in their corresponding rows of Table 1. In actual application, it is 
possible to carry out the r calculations in each row on a calculating machine 
without recording any of the values except the rth one in the column 
headed /(0). 

Table 1 



Trial 



Trial ft 
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rn" I 
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/9 1 
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/3 


/OS) 


rp- j A 




1,0 


+6. 965369 


+10.810494 


+5.407203 


.840052 
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1 




1,954052 


1 018120 


305081 


688806 


~5 011317 


2 




1.945369 


1 044742 


162598 


+ 023810 


~5 020000 


3 




1.945659 


1.043850 


.167379 


- 000142 


5 019710 
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1 945657 


1 043856 


167347 


H- 000018 


~5 019712 

















APPENDIX III 

A METHOD OF DETERMINING THE SQUARE ROOT ON 
THE CALCULATING MACHINE 

Newton's* iterative method of determining square roots may be used 
very advantageously on the calculating machine, using Barlow's Tables to 
determine the first trial value of the square root. 

Let N be the number whose square root is desired, and let X Q be the first 
trial value for the 1/N derived from Barlow's Tables. Determine N/XQ on 
the calculating machine, and record it. 

Compute a new trial value, x x = .5(xz+N/x^ by determining the cumula- 
tive sum of the two products, (.5rc ) and [.5(N/Xo)]. Determine N/XI. If 
the two values i and N/XI are the same to as many decimals as desired, 
then V / N Xi = N/xi. If these two values differ, this process may be re- 
peated for as many trials as are necessary to find that trial value x which 
agrees with N/x to the required number of decimals. 

EXAMPLE 
N =42.072. 



XQ 



= 1/42.07 in Barlow's Tables = 6.4861391 . 

= 6.48644738. 



Xi = ,5x Q + .5 = 6.48629324. 

= 6.48629324. 



* E. T. Whittaker and G. Robinson, The Calculus of Observations (London: Blackie & 
Son, 1926), pp. 79-80. 
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Abilities 

common, 54-63, 66-67, 69 

definition of, 48 

distribution of, 49-50 

indices of, 48-49 

linearly independent, 50 

negative, 165-66 

oblique, 50, 52 

orthogonal, 5051 

primary, 51-53, 55, 73, 75, see Primary 
traits 

reference, 46, 48, see Reference ability, 
Reference trait, Reference vector 

in scientific formulation, 45, 50, 54r-55 

specific, 54r-63, 67 

statistically independent, 50 

unique, 55, 62-63 

unitary, 51 

Absolute variable error, 49 
Adjoint of a matrix, 9 

algebraic uniqueness, 73 
Analytical method 

corrections for trial vectors, 189-94 

criterion for best-fitting simple struc- 
ture in, 185-89 

numerical example of, 194r-97 

successive approximation in, 189-97 

theory of, 185-94 

Angular separation of hyperplanes, 160-61 
Array of a matrix, 2 
Assumption 

in index of ability, 48 

of linear contributions of factors, 50, 52 

in test variance, 54, 56 
Augmented co-ordinates, 158 
Augmented correlation coefficient, 118-19 
Averages, method of, 175-77 
Axes 

centroid, see Centroid axes 

co-ordinate, definition of, 157 

principal, 120-33, see Principal axes 

Bounding hyperplane, 200 

Brigham's data 

augmented correlations in, 119 
centroid analysis of, 108-18 
centroid co-ordinates for, 117 
correlations corrected for uniqueness, 

119 
factorial matrix for, 117 



graphical analysis of primary traits in, 

167-70 

intercorrelations of fifteen tests, 108-9 
method of averages for, 176-77 
method of oblique axes for, 172-74 
primary traits in, 167-70, 172-74, 176- 

77 

principal axes of, 132-33 
simple structure in, 167-70, 172-74, 

176-77 
trait configuration plotted on a sphere, 

167-68 
Brown and Stephenson data, 142-44 

Calculation, outline for centroid method. 

232-50 

Case of n tests and n factors, 77-81 
Categories, relation to 

correlational matrix, 153 

factorial matrix, 153 

order, 153 

orthogonal simple structure, 153 

reference vectors, 153 

simple order, 153 

simple, structure, 153 

structure, 153 

trait configuration, 153 
Cell entries, see Elements 
Centroid, distance from origin, 94 
Centroid axes 

orthogonal, 96 

trait projections on first axis, 94 

trait projections on second axis, 97 

transformation to principal axes from, 

123-24 
Centroid co-ordinates 

for Brigham's tests, 117 

check on calculation of, 97-98 

of n points, 93 

for traits, 94, 97 
Centroid method 

for Brigham's data, 108-18 

communalities in, 98-118 

with communalities in the diagonal 
cells, 101-2 

diagonal entries in. 98^-118 

with each diagonal entry greater than 
the communality and less than unity, 
102 

with each diagonal entry less than the 
communality, 102-3 
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Centroid method continued 

fictitious eight-variable problem with 
known communalities, 103-8 

first-factor loading by, 94 

first-factor residuals in, 94r-95 

imaginary factors by, 102-3 

numerical examples of, 98-118 

principles of, 92-98 

purpose of, 92 

second-factor loading by, 97 

steps in calculation, 232-50 

sum of all coefficients in R, 94 

sum of column of R, 94 

with unity in the diagonal cells, 98-100 

with unknown diagonals, 108-18 
Centroid reference vectors, 151 
Centroid solution, scientific interpretation 

of, 92, 113, 118 
Change of signs in 

column of F, 71 

row of F, 72, 95 

Characteristic determinant, 27-28 
Characteristic equation 

coefficients of, 27-28, 123, 125-26, 132 

definition of, 27-28 

for principal axes, 123, 125-26, 132 

roots of, 123, 126, 133 

zero root of, 126-29 
Characteristic matrix, 27-28 
Check, arithmetical 

on calculation of centroid factor load- 
ings, 97-98 

in sign-reversing method, 107 
Cluster of traits, 96 
Coefficients of characteristic equation, 

27-28, 123 

Cofactor, 7 

Common abilities, 54r-63, 66-67, 69 

Common factors 

in complete correlational matrix Ri t 

65-66 

dependence upon test battery, 54 
in factorial matrix F*, 57, 59 
in moment matrix, 64-65 
in population matrix, 56, 58-59 
in reliability coefficient, 67-68 
in score matrix, 60 
in self correlation, 66 

Common-factor space, 69 
Common-factor variance, 54, 62-63, 68 

Communality 

in centroid method, 98-118 
as complement of uniqueness, 63 
definition of, 62 
effect of rotation on, 128 
by expansion of a minor of order (r +1), 
86 



by expansion of principal minors of 
order (n 1), 91 

by expansion of principal minors of 
order (r+a), 91 

geometrical interpretation of, 69 

by grouping of similar tests, 87 

by grouping of three tests, 87-89 

by highest coefficient in each column, 89 

by linear dependence of rows or col- 
umns, 89-90 

for matrix RQ, 73 

methods of estimating, 85-91 

in reliability coefficient, 67-68 

after rotation, 128 

by sectioning of the matrix, 90-91 
Complete correlational matrix Ri 

cell entries of, 65-66 

common factors in, 65-66 

definition of, 66 

diagonal entry of, 66 

error factors in, 65-66 

geometrical interpretation of, 69 

relation to moment matrix, 65 

relation to score matrix, 64-65, 70 

specific factors in, 65-66 
Complexity of 

composite traits, 211-12 

traits, 155 
Composite matrix 

definition of, 83 

numerical example of, 83 

theorem, 83-85 
Composite test, 51 
Cone, degenerate, 162-63 
Configuration 

correlational, 151, 153 

trait, 151, 153, 157 

uniqueness of, 74-77 
Constellations, 174r-75 
Constructs, ideal, 44-48, 52, 54 
Convincingness of 

hypothesis, 45, 73, 78 

primary traits, 161-62 
Co-ordinates 

augmented, 158 

of centroid of n points, 93 

of traits on centroid axes, 94, 97 
Co-ordinate axis, 157 

Co-ordinate hyperplanes 

angular separation of, 160-61 

definition of, 154 

distinctness of, 156 

equation of, 162 

normals to, 157-58 

overdetermination of, 156 

positive, 200 

for the trait configuration, 157 
Coplanar traits, 177-79 
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Correction for uniqueness 

definition of, 118-19 

geometrical interpretation of, 118-19 
Correlation 

experimentally obtained, 72-73 

between oblique reference vectors, 160- 
61 

between primary traits, 160 

between principal axes, 128 

between tests, 64, 66 

between trait j and reference vector A, 
121 

between trait j and single common fac- 
tor, 139-40, 146-47 
Correlation coefficient 

augmented, 118-19 

and complexity of composite traits, 
211-12 

linear expression for, 92 

summational form of, 66 

and underlying rational equation, 206 

unitary elements in, 207-12 
Correlation matrix 

complete, 65-66, 70, see Complete cor- 
relational matrix Ri 

and descriptive categories, 153 

diagonal entries of, 66, 70, 73, 102-3, 129 

element of, 92 

experimentally obtained, 72 

geometrical interpretation of, 69 

intercolumnar correlation in, 134r-36 

intercolumnar proportionality of, 134r- 
36 

maximizing positive sum of coefficients 
in, 98-103, 107-8, 112-14, 117 

minimizing number of negative signs 
in, 103-17 

order of, 65 

properties of, 70 

rank of, 56, 95, 102-3 

of rank one, 134^49 

reduced, 66, see Reduced correlational 
matrix R 

relation to moment matrix, 65-66 

requirements for factoring, 92 

sum of all coefficients in, 94 

sum of column of, 94 

unique configuration in, 74-77 

Correlation matrix R, 70-72 

Correlation matrix jRo, communalities in, 

73 

Correlational configuration, 151, 153 
Cosines, see Direction cosines 

Criteria for 

isolation of simple structure, 163, 185-89 

oblique simple structure, 156 

reflection of traits, 96-97 

sign reversals, 96-97 
Criterion for orthogonality of a matrix, 218 



Degenerate cone, 162-63 
Degrees of freedom, 45 
Dependence, linear, 32-33 
Determinant 

characteristic, 27-28 

cofactor, evaluation of, 7-8 

conventional representation, 5 

definition of, 5 

evaluation of, 6-9, 11-13 

first minor of, 7 

inversion of elements, 5 

leading term of, 4 

of a matrix, 3-13 

minors of, 6-7 

order of, 3 

position sign of cell of, 3-4 

principal oliagonal of, 3 

principal minor of, 6 

of product of two matrices, 20 

properties of, 11-13 

secondary diagonal of, 3 

term of, 4-5 
Diagonal, secondary, 3 
Diagonal entries for 

centroid method, 98-118 

complete correlational matrix Ri. 66, 
102-3 

correlational matrix 1?, 70 

Hotelling's solution, 129-30 

imaginary factors, 102-3 

matrix RQ, 73 

principal axes method, 129 

reduced correlational matrix R, 66 
Diagonal matrices 

in factorial matrix Ft, 59-60 

in moment matrix, 65 

properties of, 21-23 
Diagonal method 

numerical example of, 81 

theory of, 78-80 
Difference of two matrices, 19 
Direction cosines 

corrections for, in analytical method, 
189^94 

corrections for, in single-hyperplane 
method, 18^-85 

of normal to hvperplane, 34r-35, 159-60 

of oblique reference vectors, 154-55, 
159-60 

positive, 166 

of primary trait vector, 160 

of reference vector, 121-24 
Distance of 

centroid from the origin, 94 

linear space from origin, 34r-35 
Distinctness of hyperplanes, 156 
Distribution of ability 

Gaussian, 49-50 

not .Gaussian, 49-50 
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Elements 

of complete correlational matrix Ri y 

65-66 

of correlational matrix R, 92 
of factorial matrix F 4 , 57, 59 
of a matrix, 2 
of matrix R 0} 72 
of moment matrix, 64 
negative in F, 165-66 
of oblique factorial matrix V, 155, 157, 

200 

of population matrix P*, 55-56, 58 
of score matrix, 60 
unitary, 205-12 

Ellipse 

and isolation of primary traits, 178-80 

projection of unit trait vectors into, 

178-80 
Equation 

characteristic, see Characteristic equa- 
tion 

expanded notation for an, 13, 28 

fundamental, 52, 54-60 

graphical representation of an, 14-18 

of a hyperplane, 35, 162 

in \mp and /3 P , 123 

of a line, 33 

matrix notation for an, 15-19 

normal form of, 34r-35 

of an oblique simple structure, 162-63 

of a plane, 35 

rational, 45, 206 

rectangular notation for an, 14^15, 17-18 

summational notation for an, 15-16, 

28-31 

Error, absolute variable, 49 
Error, sampling, 72 
Error factors in 

complete correlational matrix J2i, 65-66 

factorial matrix F 4 , 57, 59 

moment matrix, 64-65 

population matrix, 56, 58-59 

score matrix, 60 

self correlation, 66 
Error variance, 54, 62-63, 67-68 
Eulerian angles, 214 
Expanded notation for equations, 13, 28 
Experimentally independent, 50, 76 

Facilitating tables for single-hyperplane 
method, 185-86 

Factors 

common, see Common factors 
error, see Error factors 
imaginary, 102-3 
linear combination of, 52, 206 
linearly independent, 72 
primary, see Primary traits 
specific, see Specific factors 



total number of, 65 
unitary, 207-12 

in variance of a test, 54-55, 62-63, 68 
Factor analysis 

conditions for unique solution, 71, 74-77 

and distribution of ability, 49-50 

as first approximation, 206 

object of, 53, 63, 70, 157 

raw data of, 54 
Factor loading 

check on calculation of, 97-98 

elimination of negative, 201-5 

of first centroid factor, 94 

in fundamental equation, 52 

invariant, 55, 120 

negative for tests, 120-21 

notation for, 57 

of second centroid factor, 97 

of test j with single common factor, 

139-40, 146-47 

Factor methods, applicability of, 48 
Factor problem, 150, 163 
Factor theorem, the fundamental, 70 
Factorial description, fundamental cri- 
terion for, 55, 120 
Factorial matrix 

for Brigham's data, 117 

and descriptive categories, 153 

maximizing number of zeros in, 179-85 

negative cell entries in, 165-66 

for parallel tests, 67 

relation to reduced correlational matrix, 
70 

restrictions on, 199 

and rotation of axes, 150 

scientific interpretation of, 165, 199-201 

sign reversal in, 71-72, 95 

zeros in row of, 150-52 
Factorial matrix JP, 66 
Factorial matrix F 

algebraic uniqueness of, 73 

configurational uniqueness of, 74r-77 

definition of, 70 

number of parameters in, 75-76 

rank of, 72 

unique solution, 71, 74-77 
Factorial matrix Fi, 59-60 
Factorial matrix F* 

columns linearly independent, 57 

common factors in, 57, 59 

components FI, A, A, 59-60 

definition of, 60 

elements of, 57, 59 

error factors in, 57, 59 

geometrical interpretation of, 69 

interpretation of cells of, 57 

relation to score matrix, 5^-60 

row of, 57 

specific factors in, 57, 59 
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Factorial matrix, oblique 

elements of, 155, 200 

notation for, 154 
Factorial solution 

for any symmetric matrix, 78-81 

for matrix JBo, 73 

psychologically meaningful, 74-77 

for rank n, 77-81 
Faculty psychology, 53 
First approximation, 47-48, 50, 206 
First-factor loading, centroid, 94 
First-factor residuals 

in centroid method, 94-95 

rank of table of, 95 
Function for 

maximizing number of zero entries in 
V, 181-82 

maximizing number of zero factor 

loadings, 181-82 
Fundamental 

criterion, 55, 120 

equation, 52, 54-60 

factor theorem, 70 

postulate, 50 

Gaussian distribution, 49-50 
Geometrical interpretation of 

change of signs in F, 71-72 

eommunality, 69 

complete correlational matrix Ifo, 69 

correction for uniqueness, 118-19 

elements of 7, 155, 200 

factorial matrix F^ 69 

image of trait, 95 

matrices, 36-37 

moment matrix, 69 

population matrix, 68-69 

positive values of aim in F y 166 

primary traits, 157-61 

reduced correlational matrix R t 69 

reference trait, 121 

reflection of traits, 95 

residuals, 95-96 

score matrix, 69 

simple structure, 157-61 

tests, 69 

zero entries in V, 180-81 

zero factor loadings, 180-81 
Gramian matrix, 10 
Gramian properties 

and diagonal of R, 70 

and imaginary factors. 103 

in principal axes solution, 129 

and rank of R, 91 
Graphical analysis of tetrads, 141-44 
Graphical method for 

isolating primary traits, 164r-65, 167-70 

orthogonal positive manifold, 201 

rank one, 136-37 



Graphical representation of equations, 14- 
18 

Hotelling's solution, 129-32 
Hyperellipsoid, projection of unit trait 

vectors into, 178-79 
Hyperplanes 

angular separation of, 160-61 

bounding, 200 

definition of, 154 

distinctness of, 156 

equation of, 35, 162 

normals to, 157-60 

overdetermination of, 156 

positive, 166, 202-5 

positive co-ordinate, 200 

for the trait configuration, 157 
Hypothesis, convincingness of, 45, 73, 78 

Ideal constructs, 44r-48, 52, 54 
Identity matrix, 22 
Image of a trait 

definition of, 95 

geometrical interpretation of, 95 
Imaginary factors, 102-3 
Imaginary pure trait, 121 
Independent 

experimentally, 50, 76 

linearly, 32, 50, 76 

statisticaUy, 50, 76 
Index of ability, 48-49 
Inequality (5-ii), 76 
Infinitesimal orthogonal transformations, 

219 

Infinitesimal rotations, 219 
Intercolumnar criterion 

correlational, 134-36 

proportionality, 134-36 

Spearman's use of, 134r-36 
Intercorrelations 

experimentally obtained, 72-73 

of oblique reference vectors, 160-61 

positive for tests, 166 

of primary traits, 160 

of principal axes, 128 

of tests, 64, 66 

of traits, 64. 66 

of traits ana reference vectors, 121 

true, 65-66, 72 
Interpretation of 

cell entries of F, 155, 200 

centroid factorial matrix, 92, 113, 118 

factorial matrix. 165, 199-201 

oblique factorial matrix, 155 

Invariance of 

communalities under rotation, 128 
factor loadings, 55, 120 
test coefficients, 55, 120 
weights, 55, 120 
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Inverse 

evaluation of t 26 

of a matrix, 23-26 

of product of matrices, 25 

Inversion of elements of a determinant, 5 

Isolation of simple structure 
analytical method for, 185-97 
comparison of methods of, 197-98 
constellations in, 174-75 
criteria for, 163, 185-89 
fundamental criterion for, 55, 120 
graphical method for, 164-65, 167-70 
iterative procedures for, 183-85, 189-97 
method of averages for, 175-77 
by method of maximizing the number of 

zero factor loadings, 179-85 
methods of successive approximation 

for, 183-85, 189-97 
oblique axes method for, 171-74 
for rank two, 177-79 
single-hyperplane method for, 179-85 
summary of concepts in, 153 

Isolation of unique abilities, 63-64 

Iterative procedures, 183-85, 189-97 

Kronecker's delta, 39 

Law, scientific, 44-45, 47 

Leading term of a determinant, 4 

Lane, equation of, 33-34 

Linear combination of factors, 52, 206 

Linear dependence, 32-33, 89-90 

Linear forms as first approximation, 47- 

48, 50, 206 

Linear space, distance from origin, 34-35 
Linear transformation, 38 
Linearly independent 

columns of ^4, 57 

definition of, 32, 50, 76 

factors, number of, 72 
Loadings 

in centroid methoc], 94, 97 

elimination of negative, 201-5 

in fundamental equation, 52 

invariant, 55, 120 

negative for tests, 120-21 

of test j with single common factor, 
139-40, 146-47 

Major principal axis, 124, 126-27 
Manifold, oblique positive 

definition of, 200 

methods of determining, 201-2 
Manifold, orthogonal positive 

definition of, 200 

graphical solution for, 201 
Manifold, positive, 199-212 



Matrix 

adjoint of, 9 

array of, 2 

cell of, 2 

characteristic, 27-28 

column of, 2 

column vector, 14 

of common factors, 66 

complete correlational, 65-66, 69-70 

composite, 83-85 

conventional representation, 2 

criterion of orthogonality of, 218 

definition of, 1 

determinant of, 3-13 

determinant of product of two, 20 

diagonal, 21-23, 65 

difference of two, 19 

element of, 2 

of experimental correlations, 72 

factorial, 54-60, 70-74 

formulation of single-factor methods, 



geometrical interpretation of, 36-37 

Gramian, 10 

identity, 22 

interpretation of tetrads, 137 

inverse of, 23-26 

multiplication of, 14-20 

associative, 19 

cross products in, 14 

distributive, 20 

non-commutative, 18 

postmultiplication, 19-20 

premultiplication, 19-20 
non-singular, 11 
notation for equations, 1519 
order of, 1-2 

orthogonal by columns, 41 
orthogonal by rows, 41, 61, 65 
for population, 55-56, 58-59 
positive-definite, 10 
postmultiplication of, 19-20 
premultiplication of, 19-20 
rank of, 10, 33, 81-85 
reciprocal of, 25 
reduced correlational, 66 
row of, 2 
row vector, 14 
scalar, 21 

sectioning of, 83-85 
singular, 11 
skew symmetric, 10 
of standard scores, 54r-60 
sum of two, 19 
symmetric, 10 
of a transformation, 38 
transpose of, 3 
transpose of product of, 16 
Matrix A, 59-60 
Matrix D 2 , 59-60 
Matrix F, 66 
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Matrix F, 70 
Matrix Fi, 59-60 
Matrix F 4 , 54-60 
Matrix FL, 152, 154 
Matrix H, 160 
Matrix PI, 58-59 
Matrix P 2 , 58-59 
Matrix P 3 , 58-59 
Matrix R 

definition of, 66 

diagonal entry of, 66 

rank of, 56, 95, 102-3 
Matrix R, 70-72 
Matrix Ri, 65, 66 
Matrix jRo 

commonalities in, 73 

definition of, 72 

diagonal entries of, 73 

elements of, 72 

factorial solution of, 73 

number of experimentally independent 
values in, 75 

rank of, 72-73 

relation to reduced correlational matrix, 
73 

sampling errors in, 72 

sources of unique variance for, 78 

Matrix T, 160 

Matrix 7, 154-55 

Maximizing number of zero entries in 7, 
179-85 

Maximum positive sum in R, 98-103, 107- 
8, 112-14, 117 

Mean principal axis, 124, 127 

Method of averages 

for Brigham's data, 176-77 

for isolating primary traits, 175-77 

numerical example of, 176-77 

Method of oblique axes, 171-74 

Method of principal components, 129-32 

Minimum rank of RQ, 73 

Minors 

for estimation of communalities, 86, 91 

in evaluation of characteristic equation, 
123 

first, 7 

principal, 6 

for rank one, 134, 137-39 
Minor principal axis, 124, 127 

Moment matrix 

common factors in, 64-65 
diagonal matrices in, 65 
elements of, 64 
error factors in, 64r-65 
geometrical interpretation of, 69 



in matrix notation, 64-65 

relation to correlational matrix, 65-66 

relation to score matrix, 64 

specific factors in, 64H35 
Multiple-factor problem, statement of. 

150, 163 
Multiplication of matrices, 14-20 

Negative abilities, 165-66 
Negative cell entries in F } 165-66 
Negative factor loadings 

elimination of, 201-5 

in test performance, 120-21 
Negative signs in R, minimising of, 103-17 
Non-singular matrix, 11 
Normal distribution of ability, 49-50 
Normal form of equation of 

hyperplane, 35 

line, 34 

plane, 35 
Normal to a hyperplane 

direction cosines of, 34r-35, 159-60 

relation to primary trait vectors, 157-58 

and zeros in 7, 181 
Normalized standard score, 49 
Notation, summational, 15-16, 28-31 
Notation summary, 55, 57, 68 
Number of 

factors which n variables will deter- 
mine, 76-77 

tetrads for n tests, 13&-39 

variables to determine r factors, 76-77 
Numerical examples of 

analytical method, 194-97 

centroid method, 98-118 

composite matrix, 83 

diagonal method, 81 

estimation of eommunality, 86-87, 90- 
91 

method of averages, 176-77 

method of oblique axes, 172-74 

method of principal axes, 124-29, 132- 
33 

projection of unit trait vectors into an 
ellipse, 179-80 

sign-reversing methods, 98-118 

single-factor method without tetrads, 
147-49 

Oblique abilities, 50, 52 
Oblique-axes method 

for BrighanVs data, 172-74 

numerical example of, 172-74 

theory of, 171-72 
Oblique factorial matrix V 

elements of, 155, 200 

interpretation of, 155 

notation for, 154 
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Oblique positive manifold 

definition of, 200 

methods of determining, 201-2 
Oblique reference vectors 

and columns of G, 154r-55 

correlation between, 160-61 

direction cosines of, 154-55, 159-60 

relation to primary trait vector, 159-60 
Oblique simple structure 

criteria for, 156 

definition of, 154 

degenerate cone in, 162-63 

equation of, 162-63 
Oblique transformation, 43 
Oblique transformation G. 154^55, 159-60, 

185 
Order 

concepts of, 153 

of correlational matrix, 65 

and descriptive categories, 153 

of a determinant, 3 

of a matrix, 1-2 

orthogonal simple, 153 

of score matrix, 54 

simple, 150, 152-53 
Orthogonality of a matrix, 218 
Orthogonal abilities, 50-51 
Orthogonal axes in 

centre-id method, 96 

principal axes method, 120, 123-24 
Orthogonal matrix 

by columns, 41 

by rows, 41 

Orthogonal matrix F, 70 
Orthogonal matrix FL, 152, 154 
Orthogonal positive manifold 

definition of, 200 

graphical solution for, 201 
Orthogonal reference vectors, 151 
Orthogonal simple order, 153 
Orthogonal simple structure, 151, 153 
Orthogonal transformations, 21&-25 

from centroid axes to principal axes, 
123-24, 127, 132 

explanation of, 38-43 

for four dimensions, 224-25 

for infinitesimal rotation in three di- 
mensions, 219 

requirements of, 215 

for three dimensions, 218-21 
Orthogonal transformation L, 152, 154 

Parallel test, 67 

Parameters in F, number of, 75-76 
Plane, equation of, 35 
Plotting trait configuration on a sphere, 
164-65, 167-68 



Polynomial, roots of, 251-52 
Population, statistical, 48 
Population matrix 

for common factors, 58-59 

for error factors, 56, 58-59 

geometrical interpretation of, 68-69 

for specific factors, 58-59 
Population matrix Pi 

column of, 55-56 

common factors in, 56, 58-59 

components Pi, P 2 , Pa, 58-59 

elements of, 56, 58 

interpretation of cell entries, 55-56, 58 

orthogonal by rows, 61, 65 

relation to score matrix, 54-60 
Population space, 68-69 
Position sign of cell of a determinant, 3-4 
Positive co-ordinate hyperplane, 200 
Positive-definite matrix, 10 
Positive hyperplane 

definition of, 166 

method of determining a, 202-5 
Positive manifold, 199-212 
Positive manifold, oblique 

definition of, 200 

methods of determining, 201-2 
Positive manifold, orthogonal 

definition of, 200 

graphical solution for, 201 
Positive region, 166 
Positive simple structure 

definition of, 166 

precaution for, 182, 185, 200 
Postmultiplication, 19-20 
Postulate, fundamental, 50 
Postulated primary trait, verification of, 

Precaution for positive simple structure, 

182, 185, 200 
Premultiplication, 19-20 

Primary ability, see Primary traits 

definition of, 51 

in scientific constructs, 52-53, 55, 73, 75 
Primary causes 

and composite traits, 206 

in ideal constructs, 47 
Primary factor, see Primary traits 
Primary traits 

appraisal for each individual, 226-28 

in Brigham's data, 167-70, 172-74, 
176-77 

concepts for isolation of, 153 

and constellations, 174-75 

convincingness of, 161-62 

correlation between, 160 

definition of, 157 
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Primary traits continued 

in estimation of s/i, 228-31 

fundamental criterion for, 55 

geometrical interpretation of, 157-61 

methods of isolating, 163-98, see Isola- 
tion of simple structure 

number of, 75 

postulated, verification of, 171-77 

and principal axes method, 17174 

and rank two, 177-79 

relation to entries of V, 157 

and test battery, 55, 120 

theory of, 150-63 
Primary trait vector 

definition of, 157 

direction cosines of, 160 

relation to normal, 157, 158 

relation to oblique reference vectors, 

159-60 
Primary vector, 157-60 

Principal axes 

for Brigham's data, 132-33 

characteristic equation for, 123 

correlations between, 128 

definition of, 120 

and diagonal entries of R, 129 

and Gramian properties of R, 129 

He-telling's special case of, 129-32 

in isolation of primary traits, 171-74 

major, 124, 126-27 

mean, 124, 127 

method of, 120-33 

minor, 124, 127 

modified form of, 171-74 

number of, 120 

numerical examples of, 124-29, 132- 
33 

orthogonal, 120, 123-24 

projection of traits on, 121, 128, 132 

psychological limitations of, 120-21 

and test battery, 120 

theory of, 120-24 

transformation from centroid axes to, 

123-24 

Principal diagonal, 3 
Principal minor, 6 

Product of 

factorial matrix and its transpose, 70 

factorial matrix and population matrix, 

54-60 
Product of matrices 

determinant of, 20 

inverse of, 25 

rank of, 20 

transpose of, 16 
Projection of 

unit trait vectors into an ellipse, 178-80 

unit trait vectors into a 
178-79 



Projection of traits 

on first centroid axis, 94 

on principal axes, 121, 128, 132 

on second centroid axis, 97 

on single-factor axis, 139-40, 146-47 

sum of squares stationary, 120-24 

Projection of vector j on unit reference 
vector, 121 

Psychologically meaningful factorial solu- 
tion, 74-77 

Psychology, science of, 45 

Rank 

of correlational matrix, 56, 95, 102-3 

of correlational matrix R, 72, 102-3 

and diagonal entries, 102-3 

of factorial matrix F, 72 

of first-factor residual table, 95 

in Hotelling's solution, 129-30 

of a matrix, 10, 33, 81-85 

of matrix product, 20 

of matrix RQ, 72-73 

notation for, 56 

of residual table, 97 
Rank n 

factor method for, 78-81 

limitations of factorial solution for, 77- 

78 
Rank one, 134-49 

graphical method for, 136-37 

second-order minors for, 134, 137-39 
Rank two, 177-79 
Rational equation 

convincingness of, 45 

and correlation coefficient, 206 
Reality, physical, 44 
Reciprocal of a matrix, 25 
Rectangular notation for equations, 14-15, 

17-18 
Reduced correlational matrix R 

definition of, 66 

diagonal entry of, 66 

geometrical interpretation of, 69 

relation to factorial matrix, 70 

relation to matrix RQ, 73 
Reference abilities 

definition of, 51 

number of, 75 

in scientific formulation, 46-48, 52-53, 

55, 73, 75 

Reference traits, 48 
Reference vectors 

centroid, 151 

correlation with traits, 121 

and descriptive categories, 153 

direction cosines of, 121-24. 126-27, 
132, 154-55, 159-60, 168, 177, 182, 
18^-85, W-94, 106 
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Reference vectors continued 

geometrical interpretation of, 121 

oblique, see Oblique reference vectors 

and trait complexity, 155 

unit, 122 
Reflection of traits 

criteria for, 96-97 

definition of, 95 

geometrical interpretation of, 95 

sign position for, 107, 111 
Region, positive, 166 
Regression s on x, 228-31 
Regression x on s, 226-28 
Reliability coefficient, 66-68 
Residuals 

of first centrpid factor, 9^-95 

geometrical interpretation of, 95-96 

sign position for, 104, 107, 111 

size of, 200-201 

sum of column after reflection, 97 

sum after reflection, 97 

sum before reflection, 95 
Residual table, rank of, 97 
Restrictions on factorial matrix, 199 
Roots of 

characteristic equation, 123-24, 126, 132 

polynomial, 251-52 

Rotation of axes 

from centroid to principal axes, 120-33 

communalities after, 128 

and elements of F, 150 

and entries of factorial matrix, 150 

in four dimentions, 222-25 

in three dimensions, 213-21 

Sampling errors in 

matrix Ro, 72 

Spearman's methods, 135-37, 139-40 
Scalar, 20, 23 
Scalar matrix, 21 
Scalar product of two vectors, 37 
Science 

nature of, 44r-48 

of psychology, 45 
Scientific law, 44r-45, 47 
Score 

definition of, 48 

standard, 49-50, 52-53, 60 

true, 48-50 
Score matrix 

definition of, 54 

element of, 60 

factors in, 60 

geometrical interpretation of, 69 

in matrix notation, 58-60 

order of, 54 

as product of two matrices, 54, 57-58, 60 



relation to complete correlational ma- 
trix R l9 64-65, 70 

relation to factorial matrix F*, 54^60 

relation to moment matrix, 64 

relation to population matrix, 54-60 
Second-factor loading, centroid, 97 
Secondary diagonal, 3 
Sectioning of a matrix 

in estimation of communalities, 90-91 

method of, 83-85 
Self correlation, factors in, 66 
Signs in factorial matrix, 71-72 
Sign-changing methods 

comparison of, 107-8 

in numerical examples, 98-118 
Sign position for 

reflection of traits, 107, 111 

residuals, 104, 107, 111 
Sign reversal 

in column of F, 71 

criteria for, 96-97 

in row of F, 72, 95 

Sign-reversing method 

arithmetical check for, 107 
in numerical examples, 98-118 
sign table for, 104-5 
summary of procedure for, 106-7 

Sign table, 104-5 

Simple order 

definition of, 150, 152-53 
orthogonal, 153 

Simple structure 

in Brigham's data, 167-70, 172-74, 
176-77 

concepts of, 150-63 

criteria for, 163, 185-89 

diagram of a, 157-61 

geometrical interpretation of, 157-61 

methods of isolating, 163-98, see Isola- 
tion of simple structure 

oblique, see Oblique simple structure 

orthogonal, 151, 153 

positive, 166, 182, 185, 200 

subspaces in, 154, 157, 159, 161 

uniqueness of, 155-56 
Simultaneous equations 

in \mp and /3 P , 123 
Single common factor, 134-49 
Single-factor method 

with tetrads, 136-44 

without tetrads, 14449 
Single-hyperplane method 

corrections for trial direction cosines, 
184-85 

facilitating tables for, 185-86 

of isolating primary traits, 179-85 

transformation for, 185 
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Singular matrix, 11 

Skew symmetric form of infinitesimal 

orthogonal transformations, 219 
Skew symmetric matrix, 10 
Space 

common-factor, 69 

population, 68-69 

total factor, 69 
Spearman's single-factor methods 

formula r ao , 88, 139-40, 146-47 

matrix formulation of, 134-49 
Specific abilities, 54-63, 67 
Specific factors in 

complete correlational matrix R\ 3 65-66 

factorial matrix Ft, 57, 59 

moment matrix, 64-65 

population matrix, 56, 58-59 

score matrix, 60 

self correlation, 66 
Specific variance, 54-55, 62-63, 68 
Specificity 

definition of, 62 

experimental estimate, 68 

in reliability coefficient, 67-68 
Spherical triangle, 165, 168, 175 
Square root on the calculating machine, 253 
Standard error of tetrad difference, 140 
Standard scores 

definition of, 49-50 

linear form of, 52-53, 60 

matrix ^f, 54-60 

normalized, 49 

in primary abilities (or traits), 226-28 

sum for population, 60 

sum of squares for population, 60-61 

in a test, estimated from standard score 

in a primary ability, 228-31 
Statistical independence, 50, 76 
Statistical population, 48 

Structure 

definition of, 151, 153 

oblique simple, see Oblique simple 
structure 

orthogonal simple, 151, 153 

positive simple, 166, 182, 185, 200 

simple, see Simple structure 
Subspaces, 154, 157, 159, 161 
Successive approximation methods, 183 

85, 189-97 

Sum of two matrices, 19 
Summational notation 

for equations, 15-16, 28-31 

in matrix multiplication, 29-30 
Symmetric matrix 

centroid method of factoring, 92-119 

definition of, 10 

diagonal method of factoring, 78-81 



Tests 

composite, 51 

correlation between, 64-66 

definition of, 48 

geometrical interpretation, 69 

image of, 95 

parallel, 67 

reflection of, 95 

weights, 52, 55, 62 
Test battery and 

common factors, 54 

primary abilities, 55, 120 

principal axes, 120 
Test coefficients, 55, 62 
Test intercorrelations, positive, 166 
Test performance 

fundamental equation for, 52 

negative loading in, 120-21 
Tetrad difference 

definition of, 137 

graphical analysis of, 141-44 

matrix interpretation of, 137 

and mental ability, 140-41 

number of, for n tests, 138-39 

standard error of, 140 
Theorem, fundamental factor, 70 
Total factor space, 69 
Total variance, 54, 60-63, 68 
Traits 

centroid co-ordinates of, 94 

cluster of, 96 

complexity of, 155 

and concepts of simple structure, 153 

coplanar for rank two, 177-79 

correlation between, 64, 66 

correlation with reference vectors, 121 

correlation with single common factor, 
139^0, 146-47 

definition of, 48 

image of, 95 

imaginary pure, 121 

in a cluster, 96 

primary, see Primary traits 

reference, 48 

reflection of, 95-97 

unitary, 205-12 
Trait configuration 

co-ordinate hyperplanes for, 157 

definition of, 151, 153 

plotted on a sphere, 164r-65, 167-68 
Trait projections 

on centroid axes, 94, 97 

on principal axes, 121, 128, 132 

on single-factor axis, 139-40, 146-47 

sum of squares stationary, 120-24 
Trait vector 

augmented co-ordinates of, 158 

primary, see Primary trait vectors 

projection on reference vector, 121 
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Transformation 

from centroid axes to principal axes, 

123-24 

infinitesimal orthogonal, 219 
linear, 38 
matrix of, 38 
oblique, 43 
orthogonal, 38-43, 213-25 

Transformation G, 154-55, 150-60, 185 
Transformation L, 152, 154 

Transpose of 
a matrix, 3 
product of matrices, 15 

Trial vectors, corrections for 
in analytical method, 189-94 
in single-hyperplane method, 184-85 

True intercorrelations in 
matrix Ri, 65-66 
matrix R, 72 

True score, 48-50 

Unique abilities, 55, 62-64 
Unique configuration, 71, 74r-77 
Unique factorial solution, 71, 74-77 
Unique variance in JRo, 78 

Uniqueness 
algebraic, 73 
configurational, 74-77 
correction for, 118-19 
definition of, 63 
notation for, 68 
of simple structure, 155-56 

Unit reference vector 

conditional equation for, 122 
definition of, 121 



Unitary ability, definition of, 51 
Unitary factors, 205-12 

Variance of a test 

common-factor, 54, 62-63, 68 

error, 54, 62-63, 68 

factors in, 54-55, 62-63, 68 

specific, 54-55, 62-63, 68 

in terms of communality and unique- 
ness, 63 

total, 54, 60-63, 68 

uniqueness in, 63 
Vector 

in a matrix, 14 

oblique reference, see Oblique reference 
vectors 

orthogonal reference, see Orthogonal 
reference vectors 

primary, 157-60 

primary trait, see Primary trait vector 

reference, see Reference vector 

Weights 

invariant, 55 

for a test, 52, 55, 62 
Wilson's orthogonal transformation, 221 

Zero entries in V 

function for maximizing number of, 
181-82 

geometrical interpretation of, 180-81 
Zero factor loadings 

function for maximizing number of, 
181-82 

geometrical interpretation of, 180-81 

maximizing number of, 179-85 
Zero root of characteristic equation, 126- 

29 
Zeros in row of factorial matrix, 150-52 
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